1 CJGCCOGOiQG CTCAGCrCGG CGATCTCQQC CCCAGCGAGG CQGraOOGGG 
51 GGG0QQQGC3G GGGC3QGGGCG CGO^GCAQGA GCGAGTTQQQG CCQCCOQCCIG 
101 QGCCACX5GAC ACTCTCGCCC QGCTGCCCAGG TTCCCAA0\A QGCTAGQCAG 
151 AAGAACCXICC TTCACTGAAG OXATQGAQQG QQGrCX:>VQCr GTCTQCreCC 
201 AQGATCCTQG GGOVGAGCTC GTAGAAQSGG TCQOVaXAT GGATCTCACT 
251 CACTTOGAQG AQGCAGATCG TQGCCCAGAG CCTACTAGAA AOsGTUreGA 
301 CCCCCCACOV CGQGCCAGAG CreCCTCTCT GATCCCTOGC AGTACTTOVA 
351 GACTQCTCCC AGCCCXSGCCT AGCCTCTCAG COVQGAAGCT TTCCCTACAG 
401 GAGCQGCO^ CAQGAAGCTA TCTCGAQQGG CAQQCTQQQC CTTATQCONC 
451 GQGQCCFGCC AGCCAOVTCT COCCCOQQQC CTCQCX3GAQG (XCACCATOG 
501 AGTCCCACOV CGTX3GCCATC TCAGATGCAG AQGACTX3CGT GCAGCTCAAC 
551 CAGTACAAQC TGCAGAGTGA GATTQGCAAG GGTGCCTACG GTGreGTGAG 
601 GCTCGCCTAC AACGAAAGTG AAGACAGACA CTATGCAATG AAASTCCTTT 
651 GOVAAAAGAA GrTACTGAAG CfiGTKTCOCV TTCCAajTOG CCCrGGGODG 
701 AGAGGGTGCr AGGCTGGCCA QQGAQGAOCA GCCAAQCAGC TGCTCGCCCT 
751 QGAGGGGGTC TACCAQGAGA TTCCCATCCT GAAGAAQCTG GACCAGGTGA 
801 ATGrecrCAA ACIWOGAG GTCCTGGATG ACCCAGCIGA GGACAAGCTC 
851 TATTTCGTCr TTGACCTCCr GAGAAAQQQG CCOGTCATCG AAGTGCCCTG 
901 TGACAAQCCC TTCTX23GAQG AQCAAfiCTCG CCTUTAGCrc GGOGAaJTCA 
951 TCCTOGQCCr CGAGTACrnG CACTCCCAGA AGATOGTCCA CAQGGAOVTC 
1001 AAGCCATCO^ ACCTGCTXZCT GQGQGATGAT GQQCACGrGA AGATCQCCGA 
1051 CrmOGCGTC AGCAACCAGT TTGAQGGGAA GGACGCTCAG CTGrCCAQCA 
1101 CQGGQQGAAC CCCAGCATTC ATQGCCCCCG AQQCCATTTC TGATTCCGQC 
1151 OVGAQCTTO^ GTCQGAAQGC CTTCGArGTA TQQQCCACre GGGrCAOGTT 
1201 GTALIGLI 1 1 GTCTATQGGA AGTCCCCATT OKTOGACGAT TTCATCCTX3G 
1251 CCCrCCACAG GAAGATCAAG AATGAGCCCG TOGTCTTTCC TGAQGAQCCA 
1301 GAAATCAQCG AQGAGCTCAA QGACCTGATC CFGMGATGr TAGACAAGAA 
1351 TGCGGAGAGG AGAATTQGGG TGCCAGACAT CAAGTrGCAC CCTTOQGrGA 
1401 CCAAGAACQG GGAQGAGCCC CTTCOTGOG AGGAGGAGOV CTQCAGCGTG 
1451 GTGGAGGTGA CAGAQGQGGA GGTTAAGAAC TCAGTCAQGC TCATCCGCAG 
1501 CTGGACCAC3G GTGATCCTQG IGfiifiGTCCXT GCPGAGGAAG OGirCCTTTC 
1551 GGAACCGGTT TGAQCCCOVG GO^CXSGAQGG AAGAQGGATC CATOTCTCCT 
1601 CCAGGAAAGC TACTGGTGAA AGAAGGGITT GGTGAAGGGG GCAAGAGGCC 
1651 AGAGCrCCCC GGCGTCCAQG AAGACGAQGC TGOVTCCTGA GCCCCTGCAT 
1701 GCACCCAGQG CCACCCQQCA GCACACTCAT CCCGCGCCTC CAGAGGCCCA 
1751 CCCCTCATQC AACAGCOQCC CCCGCAGQCA GQQQGCTCGG GACTGOVQCC 
1801 CCACrCCGGC CCCrcCCCCA TC3GTCCrGCA TGA(XrCCAC GCAaXAC3GT 
1851 CCAQQGACAG ACTOGAATGT ATGTOVTTPG GGGKTTGGG GGCAGGGCTC 
1901 CCACGAGGCC ATCCTCCrCT TCTTQGCCCT CCTTCGCCTG ACCCAT70G 
1951 TQQQGAAACC GQGTGCCCAT GGAGCCTOVG AAA7GCCACC CGG LIUj I IG 
2001 GCATGGCCTG GGGCAGGAQG CAGAGGCAGG agaco\agat GGCAGGTCGA 
2051 QGCCAQGCTT ACCACAAOQG AAGAGACXTC CCQCTOGGGC CQOGOVOQCC 
2101 TOGCrOVQCr GCCACAGGCA TATQGnSGAG AQQQQQGTAC CCTQCCCAOC 
2151 TTQQQGFQGT GGCAGCAGAG CTLI IGILIA TTCAGAQQCT (SEQ ID N0:1) 

FEATURES: 

5'um: 1 - 172 

Start Oodon: 173 

Stop ccidon: 1688 

3'IJTR: 1691 
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Homologous proteins: 

Top BLAST Hits 



91 

gi 

91 
91 
91 
91 

gi 

91 
91 
91 
91 
91 
91 
91 

gi 



1836161 
2143629 
9256525 
3882295 
4877951 
7446417 
5729895 
7446362 
7289880 
3859986 
1711543 
5053103 
6320976 
6911862 
6321259 



gb I AAB46910.il (S83194) Ca2+/calnK)du1 in-dependent pr.. 
pir||A57156 <:a2+/ca1modul in-dependent protein kinase., 
ref |NP_061371.1| calciim calmodulin dependent protei.. 
dbi I BAA34507.il (AB018330) KIAA0787 protein [Homo sa.. 
gblAAD31507.1|AFl40507JL CAF140507) Ca2+/calmodulin-. . 
■|JC5669 Ca2+/calmodulin-dependent protein kinase.. 
NP_006540.1i calciLm/calmodulin-dependent protei.. 
1x37317 probable Ca2+/calmodul in-dependent prote.. 
AAF45480.il (AE002612) OG17698 gene product [Dros.. 
AAC72943.li CAF091074) unknown [Homo sapiens] 
P50526|SSPLJSCHPO SERINE/THREDNINE-PROTEIN KINASE.. 
AAD38851.1IAF156028JL (AF156028) calciun/calmodul . . 
NP_011O55.1| DMA polymerase alpha suppressing pr.. 
CAB72162.il (AL138649) serine/threonine-protein .. 
NP_011336.1| Ygll79cp >gi|1170647|spjP43637|KGS9.. 



pir 

ref 
pir 

$ 

re 

emb 

ref 



BLAST to dbEST: 

gi 110204347 /dataset=dbest /taxon=96. . . 

EXPRESSION INPORMATEON FOR MODULATORY USE: 
library source: 

Expression information from BLAST dbEST hit: 
gi 1 10204347: Eye (retinoblastoma) 





C. 








n n 


QAfi 


u.u 










. 581 


e-165 


. 577 


e-163 


. 409 


e-U3 


. 343 


3e-93 






. 231 


2e-59 


. 203 


4e-51 


. 199 


7e-50 


. 193 


3e-48 


. 188 


le-46 


555 


e-155 



Expression information from PCR-based tissue screening panels: 
Human Adult Brain 
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1 MEOGPAVCQQ DPRAELVERV AAIDVTHLEE ADQGPEPTON GVDPPPRARA 
51 ASVIPGSTSR LLPARPSLSA RKLSLQERPA GSYLEAQAGP YATGPASHIS 
101 PRAWRRFOE SHHVAISDAE DCVQLNQYKL QSECGKGAYG WRLAYNESE 
151 DRHYANKVLS KKKLLKQYGF PRRPPPRGSQ AAQQGPAKQL LPLERVYQEI 
201 AILKKLDHVN WKLIEVUX) PAEDNLYLVF DLLRKGPVME VPODKPFSEE 
251 QARLYLRDVI LGLEYLHOy IVHRDIKPSN LLLGDOGHVK lADFGVSNQF 
301 EGM5AQLSST AGTPARWE AISDSQCJSFS GKALDVWATG VTLYCFVYGK 
351 CPFIDDFILA LHRKIKNEPV VFPEEPEISE ELKDLILKML DKNPETRIGV 
401 PDIKLHFWr KNGEEPLPSE EEHCSWEVT EGEVKNSVRL IPayfTTVILV 
451 KSMLRKRSFG NPFEPQARRE ERSMSAPGNL LVKEGPGEGG KSPELPGVQE 
501 DEAAS (SEQ ID NO: 2) 



FEATURES: 

Functional domains and key regions: 

[1] PDCCOOOOl PSOOOOl ASNJGLYCOSYLATEON 

N-glycosy1ation site 

147-150 NESE 

[2] PDOC00004 PS00004 CAMP_PHOSPH0LJ5rTE 

CAMP- and cQwiP-dependent protein kinase phosphorylation site 

Niinber of matches: 3 

1 71-74 RKLS 

2 105-108 RRPT 

3 455-458 RKRS 

[3] PDOC300OO5 PS00005 PKCLPHOSPHOLSITE 
Protein kinase c phosphorylation site 

Number of matches: 6 

1 58-60 TSR 

2 69-71 SAR 

3 100-102 SPR 

4 160-162 SKK 

5 330-332 SGK 

6 437-439 SVR 

[4] PDOC00006 PS00006 CK2_PHOSPH0LJ5rTE 

Casein kinase u phosphorylation site 

Number of matches: 7 

1 26-29 -mLE 

2 74-77 SLQE 

3 82-85 SYLE 

4 U7-120 SDAE 

5 419-422 SEEE 

6 425-428 SWE 

7 430-433 TEGE 
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[5] FIXCOOOOS PS00008 MYRISTYL 

N-myristoylation site 

Nunijer of matches: 2 

1 178-183 GSQAAQ 

2 326-331 QQ5FSG 

[6] PDOC00017 PS00017 ATP_GT?J\ 
ATP/GTP-binding site motif A (P-loop) 

485-492 GFGEGGKS 

[7] PDOCOOIDO PS00107 PRQTEINJ<INASE_ATP 

Protein kinases ATP-binding region signature 

134-157 lOCGAYGWRLAYNESEDRHYAMK 

[8] FDOCXWIOO PS00108 PROTEmjaNASEJST 
serine/threonine protein kinases active-site signature 

271-283 nVHRDIKPSNLLL 



Membrane spanning structure and domains: 
Helix Begin End score Certainty 
1 339 359 0.946 Putative 

BLAST Alignment to Top Hit: 

>gi|1836161|gb|AAB46910.1| (S83194) ca2+/calmodul in-dependent 
protein kinase IV kinase isofbrm, CaM-kinase 
alpha [rats, brain, Peptide, 505 aa] [Rattus sp.] 
>gi |4512334|dbj lBAA75246.ll (AB023658) 
Ca/calmodul in-dependent protein kinase alpha, 
CaM-kinase alpha [Rattus norvegicus] 
Length = 505 

score = 965 bits (2468) , Expect = 0.0 

Identities = 472/505 (9350, Positives = 483/505 (95?0 

Query: 1 MEOGPAVCapPRAELVERVAAmVTWLEEADGGPEFmiGVDPPPRARAASVIPGSTSR 60 

ME PAVCQQPPRAELVERVAAI V HLEEAf GPEP NGVDPPPRARAASVIPGS SR 
Sbjct: 1 MERSPAVCOPPRAELVERVAAISVAHLEEAEEGPEPASNGVDPPPRARAASVIPGSASR 60 

Query: 61 LLPARPSLSARKLSLQERPAGSYLEAQAGPYATGPASHISPRAWRRPTTESHHVAISDAE 120 
P RPSLSARK SLQERPAGS LEAQ GPY+TGPASH+SPRAWRRPTTESHHVAISD E 

sbjct: 61 ptpvrpslsarkfslqerpagscleacmspystcpashmsprawrrpteeshhvaisote 120 

Query: 121 DCVQLNQYKLQSEIGKGAYGVVRLAYNESEDRHYAMlWLSKKiaLK^ 180 

DCMILNQYKLQSEIGKGAYGWRLAYNE EDRHYAMKVLSKKKLLKQYGFPRRPPPRGSQ 
Sbjct: 121 [XVQLNQV1<LQ5ECGKGAYGVVRLAYNEREDRHYAMKVLSI<I^^ 180 

Query: 181 AAQQGPAKQLLPLERVYQEIAILKKLDHVNVVKLIEVLIX)PAEDNLYLVro 240 

A QQGPAKQLLPLERVYQEIAILI<KLDHVNVVKLIEVUX)PAEDNLYLVFCa.LRKGPVME 
Sbjct: 181 APQQGPAKQLLPLERVYQEIAILI<laI3HVNVVKLIEVLIX)PAEDNLYLVFDLLRKGPV^^ 240 
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Query: 241 VPOlKPFSEKyVRLYUCVILGLEYLiCQKI^ 300 

VPCDKPF EEQARLYlJ?DfILGLEYlJ«]QI<IVHRDIKPSNLLLGD(XS^^ 
Sbjct: 241 VPOKPITEEQAI^YIJVDIILGLEYLHIJKIVHRDIK^ 300 

Query: 301 EGhDAQLSSTAGTPAmAPEAISDSGQSFSa<AIJ)VWATt^ 360 

E(MyVQL5STAGrPAFMAPEAISI>H3C5Ra<AIJ3^ 
Sbjct: 301 ECasDAQLSSTAGTTPAFMAPEAISCTOQSFSGK^^ 360 

Query: 361 mWCnCNEPWFPEEPEISEELKDaLKMlJDKNPETra 420 

LHRKIKNE NWFPEEPE+SEELKDLILKMLDKNPETRIGV DIKLHPWVTK+GEEPLPSE 
Sbjct: 361 mwaKNEANA/ITEEPEVSEEUCDLILKMLJDKNPETKIGVSDIK^ 420 

Query: 421 EEH(3WEVTEGEVI<NSVRLIPSWrmLVKSMLJy^ 480 

EEHC5WEVTE EVKhSVHJPSWn\^LVKSI^«J^KRSPGNPFEP(^^ 
Sbjct: 421 EEHCSWEynEEEVKNSVKLEPSIimVILVKSr^^ 480 

Query: 481 LVKEGI=GEQGKSPELPGVQEDEAAS 505 

L^KEG GBGGKSPELPGVQEDEAAS 
Sbjct: 481 LLKEGGGBGGKSPELPGVQEDEAAS 505 (SEQ ID NO: 4) 



Hnner search results (Pfam): 

Model Description 

Eukaryotic protein kinase domain 
CE00022 MAGUlC^ubfanri1y_d 
E00359 bonejnorphogenetiQ_protein_receptor 

CE00031 VBGFR 
CE00203 ERBB_RECEProR 

E00334 urotrDphin_receptor 
CE00292 PTK_membraneL-span 
CE00287 PTKja5K_orphan_receptor 
CE00291 PTK_fgf_receptor 
E00286 PTK_EGF_receptor 
CE00290 FTK_Trk_fanrny 
CE00016 GSK_glycoqen_synthaseJdnase 
CE00288 PTKJ[nsuTin_receptor 



PF00069 
CE00022 
CE00359 
CE00031 
CE00203 
CE00334 
CE00292 
CE00287 
CE00291 
CE00286 
CE00290 
CE00016 
CE00288 



Score 


E-value 


N 


275.1 


8.8e-79 


1 


45.1 


3.2e-13 


1 


26.1 


l.le-06 


1 


13.6 


0.00033 


1 


6.7 


0.16 


1 


6.6 


0.047 


1 


-66.6 


4.1e-05 


1 


-81.5 


0.0049 


1 


-95.1 


0.0027 


1 


•119.5 


0.00094 


1 


132.1 


2.8e-06 


1 


206.4 


3.1e-05 


1 


225.0 


0.18 


1 



Parsed for cJomains: 



Model 


Domain 


sea-f sea-t 


hniTi-f hmm-t 




score 


E-value 


CE00031 


VI 


243 


296 .. 


1039 


1092 




13.6 


0.00033 


CE00334 


VI 


264 


297 .. 


670 


703 




6.6 


0.047 


CE00203 


VI 


262 


307 .. 


852 


897 




6.7 


0.16 


CE00359 


VI 


271 


320 .. 


272 


325 




26.1 


l.le-06 


CE00288 


VI 


179 


382 .. 


1 


269 




-225.0 


0.18 


CE00291 


VI 


128 


382 .. 


1 


285 




-95.1 


0.0027 


CE00290 


VI 


129 


390 .. 


1 


282 




-132.1 


2.8e-06 


CE00286 


VI 


128 


403 .. 


1 


263 




-119.5 


0.00094 


CE00292 


VI 


128 


405 .. 


1 


288 




-66.6 


4.1e-05 


CE00287 


VI 


128 


407 .. 


1 


260 




-81.5 


0.0049 


CE00022 


VI 


247 


409 .. 


118 


283 




45.1 


3.2e-13 


PP00069 


VI 


128 


409 .. 


1 


278 [] 275.1 


8.8e-79 


CE00016 


VI 


64 


481 .. 


1 


433 [] -206.4 


3.1e-05 
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1 C0QCCC3GCQC ATCCATCTOG GCCTCAGCGT GTCCGGAGCA ATCACAACAG 
51 OVGCGGCAOV ACAACAACTC ACmTACGG CCrCOTAGT GGCAQGCACT 
101 Gl ICIGAGC3G CGTACQQGC GTTCCCrCCJ CAQCATCTCA CCACCTQCGG 
151 TGAGGTGAGG CCGQCTAGAA CCCOVTCTTC GGGGGGAGGA AAACCOVAGG 
201 CAONGAGQGG AAQCDVCCTC CTCACXSQQCr COCAQCCAQG AAAQQGTCCA 
251 GCCTQGCTCC CTX3QCTTCAG AGCCTCQGCG CCAAACCGC3G TAACAQQGCT 
301 CAQQCTQGAA OW5GAAACCT TCFQCCCGGA dTGCTlQQGr GACCCOiQGC 
351 CCATCCCCAC CC3GCTQC3GCC TCCCTCTACC TATCTAAGAA AAGCAQQGAA 
401 AGGTCTTCM QQOTAAAQGA QGATCQCCTC TTQCTOGAAT QQCAACCTCA 
451 AGGAAATAGG ONAATTTTAT GQQCOCQQGC AGCJCTCTOQC TTCreGCTCr 
501 GGCQGCTCrc AGTCCCXHAG TCCCTGCCrA QGQCCAAAAA GCAGGAGCTC 
551 CTCACrCTOG AGTTCAnnCT GrTATATGTC CreOGGCCTG AGGLI KiLIG 
601 GGGTTCCCTC TCTCAGGCTC CTTTCTCATC TCTCTAATOG GGACAQQGCT 
651 GTAACXSATCA CTATOQCAAC CACTCATTTA TTCAACAAAT ATTTATCJGAG 
701 TTCCTATCAC ATQCOVQQCA CPGATGATCT TTTOGAGACA AGQO«3ATGA 
751 GCGTCCTAAT CTCATGAAAC TTACATTOQG GAQGGAAAAA CAAQGCATGC 
801 GGAGTCAQQG GAAQQQGGQG AGGGGPOGGC CACCTGCrOG GAQGAQGCnG 
851 GCGGGTCCPG GAQQGTCTTC COVGCTTTCG CTTCCTCCIT CCTATQCrCT 

901 croGnrccA aqctctcccc gaaqctcovg coxAcrcAC TGroccrcrc 

951 ACOCCrCCA GQGAQQCCrc CCTATGCO^C AGCCrCTCAC CTCCrCTCGG 
1001 GAQGCCrccr TATQCCACAG CCCCACTCTC TGrCCrCTCT OVCCTCCTCC 
1051 AGQGAQGCCr CCCTCPGCCA CAGCCCOVCT CCCTGTCCCC TCTCACCTCC 
1101 TCOVQQGAQG CCTCLI IGI(j COVCAGCCGC ACTCACPGrC TCCTQCCCTC 
1151 TCrrcCAQQG AQGGCrCCCr GATACTCTAG CCTCACTCfiG CCTCCTO^CC 
1201 TCCm^CCT CCrcCAQQGA QGCCTCCTTC ATGITCCAGC CTCATTAACT 

1251 ccCTcrcAcr ccrcrxsoGrc caqcttccat gaotttcct GrrccrAGTxs 

1301 TQGAGCCrOC TCTCTTCaT TCTCCATGTX: AGCACCAQCC CCACCX3CCTC 
1351 CAQGOTCrA CTOVrTCAAC ACACTCGGrA CGGGGCACAG GGGGTtTGGA 
1401 CCrCACCCTT ACCCTCAGTC TACCTCCAAA CCCIGCTGIG AGCCTX5GAAA 
1451 ATATQGGAAG GCAQQGAATC CAOVQGACAA GTC3QGGAGAC TOGGGCTCAG 
1501 AGTCGQGAAG GAQCPQGrCT AQOGCCCCFG GTOGGTCAGC AGGCAQGACT 
1551 QGAACCCAGT COnQGCTCCT CAGTOQCGGG TQGACTCOVG CCAGCCCTCC 
1601 CPGGCTCACA TCTGTCAAAG CAAGGGGATC GGGAAGGAGC GGTAGAGCAG 
1651 GGQCTTCACC ATOaJTACrC TOGGTCTCCC TGAGACCCAT GTTCrCAGTT 
1701 GCTGTUnQQG TTCGGAQGAA GrTACCAQCA GACAGGAAGG ATQGAQQCTC 
1751 AGGAGnrrOVC TCACTTCar CrCCTTGAGAA CATCOVGAGT CCAGCGCAAG 
1801 CAQQQQGAAG GQCATCAGGT TQQGCATQGC OVQGQCTCrA O^AGCCTQGG 
1851 ACAGAGATQG GGGrCTCAGG CTGAGTCTCA COSTTCfiGVC GGGGGTCAGG 
1901 ATGTAGCCCA GQGTCATQGC TGAAQGrGAG GGCTOGGGGr CACCVCCCTG 
1951 AIGI I ICAQC CGCCACACAG TGAlCTTTGAG AACATGAGTC TCAQQQGATG 
2001 TCATGC CCCT GTTTCACOCC TCATTCCCCT CATTCCCATC CCCTTCCnT 
2051 IIIIIGAAAC GGAGTCrnGC TCavrO\0CC AQGCTXaGAGT GTAGTOGGGT 
2101 GATCTTCGCr CACTGOSACC TCCAOCTiCCC AAGTHCAOVC GATTCTCCrG 
2151 CCT Q^GCCT C CGGAGTAGAT GQGATTTOVG GTX3CACX5CCA COVTGCCTCG 
2201 CTAAI I I I IG TAI 1 1 I lAAT AGAGACAGAG TTmSCCATG TTAGCOVQGC 
2251 TAGTCrCGAA CTTCTXiACa" CAGGTGATCC ACCTGCCTCG GCCTCCCAAA 
2301 GTGCTGQGAT TACfifiGTGTG AGCCACCATG TQGQGCOCAT CXZC LI I G I I \ 
2351 TGACAGACGT CAATGAQGCA GQGCTOGCre GAGFGQGGAG CXXO«3QGAA 
2401 GTCrrcCTXiG AAQOVGTCAG AQQGATQQQG GTAGGAQGCT GAAACATCAA 
2451 GGAQGGCrCC CTCGAQGAQG CQQGrcGGTC TGAAGCATCA GCAAQGCTTC 
2501 TGAGTTACTA GTXmCTAGCT OVQCTTCCAG GAQGCAGrxn" CGGAGrGCTC 
2551 TCCrcrCAAG QGmSQGACr CATGACrCAC AGGGCIGCAT GCTXJrGCTGG 
2601 GGCTGAGCTG ACCOGGGCT CFQCCOGTC CAGTOCTGCT GGGCCTGOVG 
2651 GOTCreCOC TCTCTUrCCr GATTCCAGAA TATCAGATTC TCrCTGCTTC 
2701 CCTGreAAQC CAQCAQGCAG AAGPGACTQC CTCTGnACC GGCAGQGATA 
2751 CTGAGGCCTA GAQQGCTOGC ATGCGGCAGA ACOGATCTGA ATTCATTCAG 
2801 GTCATAGGGA CAGACTPGAG TTTQQGreiT GGCAATCCGG GTAGAQQGA^ 
2851 CAQCOVQQQC AAAQQCATQG AQGTOQGACC OVOVQOGCTC TQGCTACXnT 
2901 AlGCraGTAGC OVaXTGAOV CCCAQGAGTX; AAQGCTTCrC T5C L I IC I I I 
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2951 TCTGVQCTTC COVACAAQGC TAC3C5CAGAAG MCCCCCmS ACTGAAGCAA 
3001 TQGAQQQQC5G TCOVQCTCTC TGCTCCCAQG ATCCTCiGQGC AGAGCTXXnA 
3051 GAAC3QQGTQG CAGCCATCGA TGTCACTCAC TTCGAQGAQG CAGATQGTOG 
3101 CCCAGAQCCr ACTAGAAAOJ GTCTOGACXIC CCC^CCAC3QG GCOVGAGCTG 
3151 CCTCTCTW aXFQQOVOT ACTTCAAGAC TQCFCCOVQC COiGCCTAQC 
3201 CrCrCAGCCA QGAAQCTTTC CCTACAQGAG CX3QCCAGCAG GAAQCTATCr 
3251 QGAQGCGCAG GCTXSQGCCn" ATGCCAOSGG GCCTGCCAGC CAOVTCTCCC 
3301 CCCQQGCCre GGGGAQGCCC ACCATOSAGr CCCACCAGCT QGCCATCTCA 
3351 GATQCAGAQG TTOGTXSQQQC AGAACGAQQG GTTCTTCArc AGCCCCTCAG 
3401 TAGTCreCAA TGAAGACTCT TTCCTCGCCX: TCICIGIGCC ACACXSQCTAT 
3451 CTAGCTTTOG TTTGCATACC CTCAGAQCTG GQGAGATCAC TACCTAACAA 
3501 TATAGCTTCr TCCCAACCAG GQGAGCTCCA GCTGAGCCAA AGGCreCCTT 
3551 CCCTAAGTCC TGOATTCCC ACTCCCAGCC OVOGCOAQG AAATAGGTCT 
3601 CrOCCTQGrc CCCTATCTAG TLI ILI lAGA GfiiTGTGfiifiOK TAGATOCTAT 
3651 GraXCCTTC CCCGCTAACr QTCTCCAGC TTCXMXXrr CGCCTCTAAT 
3701 TCTTGCCrCTT AGAGTCTCCT GTGACTCAGA AQGQGCC3QQC CTCCCTCCAG 
3751 CCrCTGOGCr TCTGCTOGAG TTCrPGCCAT TTAQGTCFGA AAGTGAACTC 
3801 AQGTTCOWS CAGTXTACAG ATGTCAQQQG CrGAGCTTTC TCTQCCTGAA 

3851 COCAQQCrcr CAGGCrCTUr gcccagqqct cctcatotc tccttqgagt 
3901 CTAGACOTC TCATTCAQCr G L I I L I G GAA ATACTTCCTC ATGG b l ML I 
3951 CATGGATTAG GGTCTTCCAG ACTCCAGAAT CCAGACAQGA ATTAGCGTTT 
4001 TCCC TTCACC ACIGCTTCTG GQGAACAAQG CAO«3CCATG GGGTCACCAT 
4051 CCATCTTTTC AAAOVTCAGC CAGGrCrTCT GOTCACArAC GGGGGCGATC 
4101 GCAOCACjCAA OTCCCCATC OWVCrCAAA AQCmOGTGA GAGCTOGGGG 
4151 TGGQQGAATG AQGAGCTTAT GQCXAGAATT QGAOXTCAA CQOGCrcTCA 
4201 GGTAGGAGCA GTGCreCCTC GQGACCCAGC TCOVCCTGGT GCTOGCTCTT 
4251 CCCCCACAGG ACTGOOTQCA GCTCAACCAG TACAAQCTQC AGACTGAGAT 
4301 TQGCAAQGTA GGAGTQGGCA QGCOGAGAGC AGTOQQQGCT TOQQGATTCT 
4351 CrcTTPOQCG CTXaCrCCTTC TCTOGTCTOG GAGQGAAOSG GAGGO\GAGC 
4401 CAGQCAAGTC CTAGCCTOGA GGTCAQGACA GnTCGTCCC CTCTCGGAAG 
4451 TACCCAQGTA CCCAGGGGGA GCGTOGfiJ^ IGGCTCCTGA TTCCOGACTC 
4501 TCTGAGnCT TCACAGTOGA CAAQGAQQGA CrGAGGGAQG CATQGAQCCA 
4551 TCTCGAQGOV AQCAQQQQCA GTTAC3CAQQG GGCAQGAGTC CCCTGOOCAT 
4601 CreCTACAAT ATTTCCCOGT GAGCCAQCTX3 GTXSGTCQGrA GTGCAGATQG 
4651 GGTGCAQGAG AGACCAGAGC TQCTGQGCrC CCCACCTCCT GAGCTOGTCC 
4701 TCGGAQQQGr TQCCCTGnrCC AQGTOGQGCT GACTGATGCC TATCTGCAQG 
4751 GreCCTAOQG TCTGGrGAQG CrGGCCTACA AGGAAAGTCA AGACAGACAC 
4801 TATCTGAGTC TQQQGATAOG AQOGAGGTCT TQGCONAGCC AQQCCCTGGA 
4851 AGCCTGAQGG 6TX5GGGCAGG Abl IGIGCI I AQGAGATAGA GGACAGQGCT 
4901 GCCTCAGAGT GAGCTCCCTC TCCOAGGQG TATCCAAAGG AATGAGOTC 
4951 CTAACCCTQG GGATATGC^A GCAGAGACTG GATTCCTCTG AGQQGAAAGC 
5001 TCCAGAAAQG CI IGLIGGGG GAATAAGQQG AAQQQCTAQG CTCAGATATG 
5051 GCCACCOCCA ACCGGQCTTA ACAOTAOCT GQQCCACACC CTCAQQQCOA 
5101 GTAGCAGATG TCCAGTreTGC CTCTCGGGAC CTCAGTCCAC ATGTACCAGC 
5151 CrGTTCTAGC CCCTGGTGGC TGCACAGTAG TGACATTTCT GTCCCrCCTT 
5201 CCTTAGGCAA TGAAAGTCCT TTCCAAAAAG AAGnACTGA AGCAGTATQG 
5251 CnrCCACGr ATUrATCTTC TGATCCrcrC (XraOGAGCT CCTAQCCTOG 
5301 AQQCAGAGGA GGAGACCFCG ATCCTGAGCT AGTTTTGGCT AGGAATGGGG 
5351 TAGAGAGQGA GACAGGGTGA GCAGAGGCCT GGQGACAGAA TCTGCCCnGT 
5401 GQGTTCQGAC AAGACCAOsG GCATGCAAGA CTLI IGLI IG AGA CIUj I I I 
5451 GQGGGCCAOG GPGAGGCCCA GCCACCTQGA ACAG b l G I I I GAGTrCrCTT 
5501 CCTOGTCAOV QGTGQCCCrC CCCGGAGAGG GTCCOViGCT GCGCAGGGAG 
5551 GACCAGCCAA GCAQCTGCTC CCCCTQGAGC QQGTCrA<XA QGAGATTCCC 
5601 ATCCTGAAGA AQCTGGACCA CGTGAATCTG GTCAAACTGA TCGAGGTAGG 
5651 QQGTXSGTCGr GAGCAG6TGG GAACCAGCAC CreAGTCTCA TQGGAGCGQC 
5701 TTCTX3GTX3Cr GQQGAGCCCC TAGCACAGAC CCAGGGATCT TGCCCAQGTCi 
5751 GCAGATG1T3G CTCAGGCCTC TGAQGACAGG GCCAGACTTC GGGTOGGGCT 
5801 QCAQGAAQQC TTTCQQQGCC CfiCCCTGETC AQOGATCrTC CCAAOTTCCC 
5851 AIGGAGGGTC AGGGGCTGGC CCAGAGGCAA GAAGPSAGGC GCFCATTGCA 
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5901 GCTQGAGQQG AGGAAGGCTC GATCTCGTCT GGQGGGCCAG GTRiQQQOTC 
5951 GGTGACTTCr GAQGCCCCAT CAGTCTCQCA CCACCTGTAC ACTTCCTOCT 

6001 Tccrrcrcrc GOGTCGrrec atccatacta AGGbi iligg qgctogcaag 

6051 GACOVQGAQG CXTCQGACCr CXIAACOCCAC QCCTCCrCMK GCCCCACCCC 

6101 CATCrcrecr occtctcacc aqgtgctoga tga«xcagct gaqgacaaoc 

6151 TCTATTTCQG TGAGTGACCr GGCTCATTCC CACAGCAGCT OVCTCAQGGC 
6201 TCGCCG^AQG GCTCCCTraG GACATCTATG ACCTTCAQGr GGGCGGTUTA 
6251 AATGCACTGA CCTCCTGOGG AO\GAAGAAA AAOkCAOGTT CTGAAGCCCT 
6301 GGATTCOCTT GCCOVQCCCT GOVGAACOVG QCGCAGAATA TCCACTTAGA 
6351 TTCMONAAT ATQQCOVAQC CCCACTCCCT GCTTCCCrcr GAGO^CAAG 
6401 ACAGTGGATC CAOGreGGCT QOXQCTCfiG GTAGATQCAG GAAGCAQQCT 
6451 GCATQGGrrc COVGACACTG TAGCTCTCTG CCTCAGmT CCCACCTATA 
6501 AAACAQGGAT ACTAGTOGTG TCTACCTCAT AQQGTTCCTG TGAAGAGTAA 
6551 ATGAGTAATT ATATGTAAAG CACATTGGTT ATTATCCTTC TTAATAGTAA 
6601 TCTTATTATT TTAbl KjLI I G I GIC I U a l I CAGGGCTX^ OTAGAiOGAG 
6651 GCCrCAGAAA ATQQQGCAGA AGMGAACTG GGTAGGAAT TAGAQGCFGA 
6701 GGCnTAGTC TCOVCTCCCT ACCCTACCTG CCrcrCTQCr ATGACCTTTA 
6751 QGAAAATTTC TGCCCCTTCT CrerXSCOTCA GTTTCCCCCr CTCTAAAAGG 
6801 GCOCCATCCr GATQCTGATG GTTCTCAOCr QQCAGCTGAG GATOVGATGA 
6851 GACAGGTCCA TAGCAGACC3C CACTCTCATC OmTATTTC CTCTCATATC 
6901 CCAGGGTCCC CTUrCCTCrC CCTXSCCTOGA GrATCCCTGC ATGCCTOCCC 

6951 ccrcrccTAC cctccagaac aqqgaqqgac cttcgcatcg gctgctttcc 
7001 cagccagcta cacotacct Tcnurcnr TcnrcAGre tttgacctcc 

7051 TGAGAAAGQG GTCAGrrcCC CXTTCCTCATC AQGO^GGTO^ ATTCTCATCC 
7101 AQGCOTCCr TCCTTrcCCr CCCTCTCTCC COVQCCCAQG QGTCAGCTAC 
7151 TCTAGGAGAA GTCAGAGACG GAGGCCCTCC CCTTAGQGGT AAATAAGAGA 
7201 CCAAGAGGAC CATTCrTPGA AQQCTCATQG QGGrCAGTGA QGCTCAAATA 
7251 GTCAGGGAGA CCTOGGAAA AGGGGAGQGA TTTTCACCCA GGGCTTCAAG 
7301 AACTAGGAAG ATAQGGATQG AQGAGAQQGG GAAGAAAQGA GKj I I I II I A 
7351 GGTAA AAGTA TATAGAQGTX; GGACTCAACT aTACC3QGTA TTCAAATCAC 
7401 AAAGGGmr TCAQCTTTCC AAOVAGTCTG TGAATQGAGT GGGTX3GGATT 

7451 ccAGrrecrc covrrrcreA gaqggaaaqc taaqgacovg agaaqgtaog 

7501 TOGCTTCCrc AAQGTCACAC AGCAAGTCAC TGATQGAQCC CAQQCrPCOX 
7551 CATGTCreCC CTATTKjQGCr TTTCAGGGrA TTTACAGAGC AGATGACATC 
7601 GAGTAATGAG CAOQGGGCTC GGTOGTCOiG GACCCTCACT QCO^AGGGT 
7651 GAATGCAGCC TGCQGCTTGT CCCTTTGOCr GQGCGGCTCC CTACAGACCA 
7701 ATCFG GGGAG AQGQGGVQGG AGTOGrGrCC CTTTAAGACT TQGAQGCnT 
7751 CAAAKjIIII GACXTCTATC CAAAAOVVGA AATATATATT TCTATTQCTA 
7801 TCCATATCre TAACTGAAAC CAAAATTTTA CAAAGCAGCA TATATCTTTA 
7851 CTACATGCAA TATATTCTCA TATATTCTAC TTATTTAG6A AAAAAAAAAA 
7901 AAAGCAGrPG CCACCCACTA AATTGATTTC ATGATCCTCT arGGGTOG 
7951 GATCCAGGGT TTGAAACAGr QCTCTAAATG GCATCTTnOC AATTGATTAT 
8001 GGACAATTAA GTACTTAGAA GAAGGAATAT CAAQCCAATC AGAAATTAAG 
8051 AGAAAGCTCA TTTGAA ATTA TGATTCAAAT GGGATATCTA TGAGTATGTG 
8101 TGCTTTAAGr I 1 1 I lATTAT GTAGCAGAAA AAGCTAATAT CrPGAGTTGr 
8151 AGGGACrCAT GPOGGOlCAG GTTTCCOQGG ACGTCCCGAC CACCPGAATG 
8201 GCGGGGTIGGC CTCATTTCAG CTGAATGCCC CTGCOGQCAr CXTTCTOCAT 
8251 AGGCmnXZA TGGAAGTGCX: CTGFGACAAG aXTTCTXHGG AGGAQCAAGC 
8301 TC3QCCTCTAC CreC3QQGAC3G TOVTCCTOGG CCTCGAGTAC TGrGAGTCCG 
8351 GGGCAGCrrc CCCACTQQQG CTOGGGCTAG GGGATCTGGC AGGCGQCAGA 
8401 GCCCAQGCrc AGCAGACTCT GAGCAGCTCC CGTCAGTCAG AGCTGACCTC 
8451 O CAATQ^ jCT TO«nT3QGAG TQQGGCATQC AC3GTXnQG0G GGQCCAAAQG 
8501 OCmTTCTC QQGFQGQGCG GGOQGFQGAC TGCACreOQC ATGTCCCAGA 
8551 TCOTCGrOG TGICIGGTCC TGTQQGrCTG AGrCCreGCT Gl ICIGIATC 
8601 I I ICI ICTCC TGAbI ICI lA GCCTAGCTTA GCGrPGCCAC GGGGCTTCAA 
8651 GAGATCTX3QG AAGGAAGGGA TTTATGrCCA GCIGCI G GGG AGAGTCnGTC 
8701 CTQGCAR3QG QCaSQQGCAT GGFOGONGGG TQGATTTACC TGrGAQOGQC 
8751 OCTAGTCreA TAAGAGCTCA GGAGGGTW GTCAGGTOG CCrCTCTXTC 
8801 ATTTCATTCA TTAGCTACAT TCACTTQCXT QQQQQCArAG GGGTCAAAGA 
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8851 CCOVGACCC3G AGrPCACQQC CTAGTIQQGAG QGACAGGAAT CTAQGCAQGC 
8901 AGATMTACA GCGFSGTOCC TQCCAAGQCT GQQGAGCCTA GAQGCTCTAG 
8951 GAGTCCCQQG QQQCTQQQGA AGTCTCCCTC AAGAGGCTAC TTATGATTCG 
9001 GGTCCTCAQG GATGAGTAGA CrTCCTTXaCr CAG bl 1 1 I G A QQGATQQGCG 
9051 TCGAAGAGGA TGFQC3CTCQC ATAGGOGTOT ACTOGAGTC TQQQGAGAAG 
9101 TQGAGTCTXiG CTGAAGCCrC CAGTX3QGCAG AQGAGGGCO; TQGTTAGreA 
9151 AAGATGATQC TGGA<\ACACr GTCOGGGCCA CAQCATGAQG GCTCQGAATC 
9201 CCrCCCCTGA GGTCTTrQCr GACTGCATCC TGCOVGCTCT GTCAGGCCCT 
9251 GAGAGCTTTA AGCATQGGGA GGGGQGTGAT GGGAIIIGIG CCTGAGAAAG 

9301 crcrcrcroG ovqctctctg gtggctggat TGGAGrcrer catoqgaqgg 

9351 TGAGAQGCAG CCAQCTXX3CC AQGGAQGAQG CIGI I ILIGC AQCCO\AGTG 
9401 ACAGATQGFG AGGCCTQGAT TAAQGCAGTG GCAGCAQGAT QQQGATAQGA 
9451 AGGAQGTOQG GTGGTCAGCA TGGAGTCACT TQCCGGTCTC GQGAGAQGAG 
9501 AGCCCXTAGA GVCCTAGGGT COX3GGGTGG GTTQGGGACC AGGGGAGAIG 
9551 CXXZATCTCTA AAATCTTAQC TTOQQCCAQG C3QOVQQGGCT CATQCCTCTA 
9601 ATCCCAGCAC TTTOQGAQGC aJAGGTXSQGT AGATCACCTG AGGTCAQGGG 
9651 TTTGAGACCA GCCTGGCOVA OGTOGCAAAA GCCTCrCTCr ACTAOWOA 
9701 CAAAAATTAG CLI IGlblGG TCGTCGGCAC CTOTAATCCC AGCTACFCGG 
9751 GAQGCTGAQG 0«3GAGAATC GCTPGAACCr QQGAQGTX3GA GGTreCAGTG 
9801 AGCGGAGATC ACGCCATTCC ACTCCAGCTT QQGrGACAAG AGTGAAACTC 
9851 CATCrCAAAA TAAATAAATA AATAAATQCA TACATACATA TATACATACA 
9901 TACATAAAAA TAAAAAATAA AATCTTAQCT TGbl I ICI IG GGAGCATATT 
9951 CTTTCCCreG GQGAACAQQG TOGGGATCTG GCnSAQGTTr GACCTQOVGT 
KXWl GAOVGAAAOV GGACIGILI I TATCCTCCTC GAQCCTCTCC TTTCCCTTCA 
10051 GATTAAGACr CTCnTQCAC ATATCQQGAA ACTGAQGCAC ACAGAQQQGA 
10101 GGGC mGOk GAAAATCCCr ACCAAGQGCC TAGAGGCATG GGATX5QGAAG 
10151 GGGACATTTT ACCCCQGrAC GGrO«JTX3GC AQQCACAGTC CRHACONGC 
10201 TFXSQCnXAC CTCLI I ICIG TTCTAGnnOOC TTCrTTCOZC TGAAGFCCrG 
10251 TTGrCTGCTA TCCCCTAGCC TCCACAAAGA AACEAGTrTA TCrTACCTGG 
10301 TTCrTOGGTA AAQCCTCATC AGGACCCAQC TAATCACAGT GAAGQGCTTC 
10351 CCTCGQGCAG AAGQGrTAGC GCOVOGGQCT QGACAQGreG ATGMCAGAG 
104O1 GCAC3GAGGGC GCTCAAGACT TQCCTTCTGA TTCrOGCOX AAGAAGAGAG 
10451 AGrrcAQQCr GGCATGAGAG GQCTCQGRSG TCAQQQCQQC OCAQQCCTOG 
10501 TTCTCAGrnG ATQQQQQCAG GTX3CAAGGAT GCAGATGATS AGAAGOVGTT 
10551 GGATCTGGAA TAGATGTGAG AAGCTGAQCT CACfiGMXTT QCTGATGAGC 
10601 A GGATG TGGG GTCTCAGAQG AQGAATPGAG GATGATCCTC AAGmTTCG 
10651 CCTT TCACAG AATGG AAAAG AATQGQGAQC AGCAGGGGGG M I IGI 1 1 IG 
10701 CI I IGM I IG AIIIIGIIGG TQGTAQGCAT TQCAQQCAGA GAAATCAAGT 
10751 TCTGAATTAG ACATCTTATT GCACrGTCTT CAGATATACA GAGACATATA 
10801 TCGATGCCTA GCTCCCTAGr TATCTACCAA GATCTCTATT QGAAATCTAT 
10851 GTGQGTAAAG AGCTQGAGIT CAAGGGAGAG GCTAGGGITT GAGATAAGAA 
10901 CATGAGACCA CTTrCCATQG TCAAATGrCC ACGCCCCTGA GLMCIGIGC 
10951 CJCTGAAQQGT GTCro^GATT CCI IGIGIGI QOCTOGCACA TAGTAGGOVA 
11001 TCAAGAAAGT GCOXCTQGTT TTATCGnAT TGTTATACQG CACCCGCCTT 
11051 CrCTXSCCGQC AGCCrCCCTC TCCrOTCTC CCTTCCrCTT TOTCrCTCG 

uioi carcrcTCC tccctcctct ccagcatcct qqggtccgtt ggtccagatg 

11151 AAGGTACrrc CTAAQGAQQG AGCCCACAQG TCGATQGTOG GQQGATQQQG 
11201 TCAGraQOGT CATTCTCTXT CTTQQCrQQG ACOTACOVG TCATOTCAGC 
11251 TTGAGCCACC TGTOVCTTOG TQGTCGTQCr GGGCCCAGAA AGCAGGGCAG 
11301 ACCrCCAGCC TATTAQGTCA I I ICIGATTT GGGATTCGTC CTACTATATG 
11351 TOGCTGACCr TACACCCCAG CTUTXTTOVTC CIGLI IGTCC CAAGGCCTQG 
11401 QGTX3CCATCC ATCTCrCFGA AACCCCATCA GCC3CAGATCC CGAQQGCTGA 
11451 GATOGTACCr CTGTAQGATA GCAGACTOCC TACMTGTA CTCTO^GTCC 
11501 CAG CAGCAG G GAOVrCTTTG CCTAGCCTCG GTCGGQGATC GAACTCGAGA 
11551 AAQGTTTTCA TraOCnTQG GCCTGCAGAC GGCACTCACA QGGAAQQQGC 
11601 AGAGCTAGCC TAQGAAGAAC TCTGCTCCCA GCrGGGGGGG GTCGCTCAGG 
11651 OCreTAATCX: O^GCACITTC GGAQQCC3GAG GTOGGreGAT CACCTCAOGT 
11701 CAQGAGmCA AGAOCAGCCT GACCAACATG GGGAAACCCT GTCVCTMIVA 
11751 AAAATAONAA AAGTAQCOiG GCXnTSGTOGC AGACACXHOT AATCOOVACT 
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11801 ACraSGGAQG CTX5AGGCAGG AGAATCTCTT GAACCTCQGA GGTCGAQGCT 
1L851 GCAGTCAGCC GAGATCAC3GC CATTQCACTC CAGCCTX3QGG GACAGAGTCA 

11901 GAcrcrcrcr caaaaaaaaa aaaaaaaaac caaaaaaaaa aacagcaaca 

11951 ACrCrCCTQC CCTAGnTCC TCPGACCrCC CCACrCAQCA GCAGATCCCr 
12001 TGTTTGTOVr QGAGAGGOTG CTOGACmOG AGTCOVAAGA CTCCTAAGAT 
12051 TCCAGTCCTC GCTCTOCTCC TCACAGCCTG GGCTCAGTCT CTGCACCTGC 
12101 GPQGAGCAGA TCGGCCTCAC GTCCTCCTCC CAGCTCGTCA CCAGACjGAAA 
12151 GTCTTGOVraG GCTQQGATCT CCCQGCCXSGC OTCCCTCGCT GTX3CAAGGAC 
12201 QQGrcrOQQG TGCraOCCAG GQGTCCCQVG QOQVQCXSCrC AGCTOWjCT 
12251 COCCI ICILI QCASreCACr GCOVGAAGAT CGTCCACAQG GACATCAAGC 
12301 CATCCfiACCr GCrCCreOQG GATGATQQQC ACGTCAAGAT CGCGGACTTT 
12351 GGOJrCAGOV ACOVCTTTGA QGGGAAOGAC GCTCAGCTCr CGVGCACQGC 
12401 GQGAACCCOV GCATTCATQG CCCCCGAQGC C A I I I CICa AT TCCQGCCftGA 
12451 GCTTCAGTOG GMSGTGMZr GGQVGGCGCT GGGCO^GGCT GGGCTTONAG 
12501 TQQQQQQOGT AATAQCTTQC CQCAGTOQCC CA b I I I C I A A CCTCAQOGTC 
12551 CCAQQOTCTT TCTCrcrAQG GACTGACATA TTTQCCrCTT CCmOGAGCC 
12601 TGACAAACrc OVCAAOTTC GCCTTCrCCr GmTCCAGC AAAGTOGTCC 
12651 CAAATCrCCC TPQOVGATAT TTACIGI IGG TTTXTCPGTG CTGCSGTTCrG 
12701 GACGGGACrc TQGAAGAGQC AGAAACAAAG AGAACCCrGT TTOCrcCOCT 
12751 CnSGATQOTT TOQQGQGAAG TTCGQQOTCC ODQCAGATCr TOGGACATQG 
12801 CAGGATTTGA ACTQQCCCTT GAAQVATQQG GAQGATCTCA GCAGGACCTG 
12851 GAGCCTAGAG AATAAACCAG AGAACAGAAG GGCTCAGGCT GQQQQQCAGA 
12901 GQGTATAAAG GGCCTQGAAG TTTQGGCnT CTCCTAACTG ACAQGAQCCT 
12951 fiCOCMfiCn GTCTGAAONA GAGOTTACAC QGrCTOQGQC AGTTCCCnQG 
13001 QCACATQQCr GnTCAOCTA TGGAGTGCCA GCCACCCCAC TGCOVQQGAG 
13051 GCnnXSGGPG AGAQGCATTT GGACACGTGT GAGTATCCAG GAAAGAGGTC 
13101 AGGAQGCCQG GCACAGTOGC TCATQCCTCT AATCCCAGTG CTTTOGGAQG 
13151 CCAAQGTGGA ILICI lAAGG CTAQGAATTT GAGATGAGGC TQQGCA/VCAr 
13201 AGCAAGACOC CATTTCTACA AAAAAAAAAA TAAAAACATT AGACAGGTTGT 
13251 GGTAGreCAC ACCTOTAGTC CCAGCTACTT GQGAQQCCGA GGTGQGAQGA 
13301 TGGCTTCAGT CCAQGAGTTG GGGGCTUTAG TGAQCrcPGA TOGTCTCTAG 
13351 CCreAGTTGAC TCAQOGACAC CTTCTCrOGA AGAAAGAAAG AAAGAGGrPG 
13401 GQGATGTTGA TAAAGATnT TTCAAATOTT TTATTTTCAT ATAATTCTAA 
13451 ATTTACAGAA AAGFTOGAAG AATAGTACAA AGAAATCCCC TATATCmT 
13501 TACCC AGATT CACQVA TTAT TGAC A I I I IG TCCCACTGGC I I I I lO VTCA 
13551 TCI I ILI I 1 1 1 1 I I IGAGCC GGAGTCTOGC TCCTCTGGCC 0\GGCTCGAG 
13601 TQCAGTGQOG OGATCTXZAGC TCACTGCAAG CKXAOCTCC TQQGTTCAOG 
13651 CCATTCrCCr QGCTCAAOCr CCGGAGTAGC TQQGACTACA GGGGCCCACC 
13701 ACCAOQCCGG GCTAATTTTT TGTAI Hill AGTAGAGAGG GGGTTTCACC 
13751 GTXJrrAGCCA GGATCGTCrc GATCTCCTGA CCTGGTGATC CGCCCGCCTC 
13801 GGCCrCCCAA AGreCTGGGA TTACAGGTCT GAGCCACCAC GCCCAGCCAG 
13851 AAATTTATCA TTGATAAGAC TTATATATOG GTCAQGOVTC GTQGCTCATG 
13901 CCTCTAATTC OVQOGCnTG QGAQGOCAAG GTAOGTOGAT CACCTCAQGr 
13951 CAQGAGTTTC AGACCAGCCT GGCCAACGTG GTCAAACCCC GTCrCrACTA 
14001 AAAAATACAA AAATTAGCCG GGCATQGTGG CQQQCACCTG TAATTCCAGC 
14051 TACrnOGGAG GCGGAQGCAG GTGGATCACC TGAQGTCAGG AGTTTGAGAC 

14101 CAQccraoci: AAamoGrrcA aaccccgtct ctactaaaaa atacaaaaat 

14151 TAGOGQQQCA TQGTCQCAGG CACCTCTAAT TGCAQCTACT TQQGAQQCre 
14201 AGGCAGAAGA ATOGITOGAA CCCAGGAGGC agaqgttgca gtxjagctaag 
14251 ATCGTGCTAT TGCACTCTAG CCTGGGCGAC AGAGTCAGAC TCTGTC7GAA 
14301 AAAAAAAAGA CATACATAAT CCACAGACCT TATTTAAATG TTATCAGTrG 
14351 TOCreATACT GTACTTCATA ALI ICI ICI I 1 1 ICIGGTCC CQGAATCCAA 
14401 TOGAQGACXA CTTCCTQCAT TCACCI ICI I GTCTCTCCTA TCCTmCATC 
14451 TGGAAGAGGG CCTTCGCCre CCbl IGICI I TCCrGATCTT GACATTTrOG 
14501 AAGACAACCA GCCTCnATT TTGrAGAATG TTCTCAGTTT GCATTTCTCT 
14551 GGTCTTCCCr GGmSQGATT CAGATCATGC ATCTX3GGGCA GGAATATCTA 
14601 QGTAGAGATC GAGAATCACT CATATAAQQG AGAAAGTGGA TAGCAGAAGA 
14651 GGTGGGGrrc GGGAGCAGAA GGTAGAGAGA GCACAGGCFG GAGTCOVQGG 
14701 CXiGGGGGAGG CGCAGGGGTS TTTQGGAGCC CAGAGGAGTT GnGCAGTOG 
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14751 C3QGFQGATC<^ GOGOGrGAGA QGAOVQGGCC TCTCTGrGGG CAGGGGCTCT 
14801 TPQCAATATC AQGAAGAAQG TQGATTATCA GGAGAAGGGA TGACrCOTG 
14851 AAGCCGGAGC TQGnTAGTG AGO«5AAGTT CCATATATAC CATCATTCCT 
14901 GGQGTGGGrc TGIGGCAaSG GAGOSGCCCG TGTGACECTC TQGATGAAQG 
14951 AGbllllbIA CXTCTTGAGr TCGAAAOnA GLIbbI lAGA G I LIMC XXA 
15001 AQGAAACCCA GAACCCCTQG AQQGTTiGAQG CCTTCTTCTG GCCGCCCCre 
15051 TGTCCTCAQC ACTCAGCACXi QGGCCCAQOV TGQQGCAAGr ACCGCGGAGT 
15101 GTTTGTOSAG TGAGTACATG ACAGAQGAAA GAGGTTCCCr GCAGGCCTCT 
15151 CCTGCAGCGC GCTOGAGCTG GGTX3GQCAGA GGTOGCTCTG C LIGI Ki GGG 
15201 ACTGATCTGA GCATCnTCT TTCOVOGGCT TQGATGTATC GGGOVCTGGC 
15251 GrOVCjGTTCT ALIGLI I IGI CTATQGGAAG GTGAG1X3CCA QQGATQCCAG 
15301 CAGA GCTGGG GCQOGTCCAG ■TCAQGaSGQC AGQQGCGACG GATGCAQQCT 
15351 CTTCCnTTT GTCCTTAAGr GGLI 1 1 IGAA AGAGCCCACC TQGCTCAGAG 
15401 AAQGCTGAGA GAGAAGAQGC TnTTCTATC TTTCTCnQGr CCCCraOQGA 
15451 QGGATTCraG CGAAQGAGTC GCAQGAOVQC AGAOVCCTAA QQQGAQGTIQC 
15501 CGACGATQGT 6TTCCCACCG CCCCfiiXCfiG AGTGCTCCCC GTCCCTCrcr 
15551 CCCnGAGQC CATnCAOTA TTGAGCCATG TOTTCACTCC CTTGCrCATT 
15601 TATTCGAOW ATTCTCCTTC ACCCCTACCC TOGCTGAQGC TQGACCCTCG 
15651 QGAOVOCCAA C3GCTGAGGrA TGQGTGATCC CTGOIOGOVG GTUTOCCTGC 
15701 TCTOGTCACX: ACACTAAGGG GCAGGGGGGA ATTTCTVGTCA ACATCTPCOC 
15751 AAGCCCCAQG CCCTGOGAGT GGAQGCCTOG CCACAGGTGG CDGGTAATOGT 
15801 GGTX5GGTCCA CCOVQCCTCG CCTGGCmQG CCGOiQGTXSG CAGTAAOQGC 
15851 GGTGGATGCA CCOVGCCTCA TTCTTCCCTC AGOVACTCAT TCATTCAGTC 
15901 AACAI I IGI I GAACATTTAC AGTCTGAGrT GAGGTCXTTC TCATCTAATC 
15951 QGfiOO0CAG^ CXmOCCXZCCT ACCCrrcCCC COVCOVAQQG AQQQQQGmG 
16001 ATCCCCTGGC ACAQGTCGAG GCCCTX3GACC CACATCCTTT GTCTCCCrCT 
16051 CCACCCCACA GreCCCGTTC ATGGACGATT TCATCCTGGC CCTCCACAGG 
16101 AAGATCA^VGA ATGAQCCClGr GGTUTTTCCT GAQGQGTGAG TTCTCCACCC 
16151 AQQQGAACM QQQQQCTACX: ACCGGCTCCT QGTCTCreAG TTTTAGCAGA 
16201 QCnTTCOX TCTXyVQGAOC CCACCCCAGC CTX5CAGATAT GAAGGTOGCG 
16251 GTXjLIGI ICC CTGQGAQQGA CCCCTGAATA GATCGAGQQG AGGGACTCTG 
16301 GAGCCAiVQQG TCTCCGCAAC GTCACreTUT QGATQQGAAC CCTXyVGATCC 
16351 AQGGTTQGCC AQQGATGACC ACAQQCATDV TTOVOMICAC TCCTTCACC3G 
16401 CAGGCCTGCC TOGGGTCAGr GQCGCCAQCC CCACCCAGCC CCTXSGACTCA 
16451 AGGGGAAOT CTCCTTCCCC CACrCAGGGT CAQGGAACTT CAAGATQCCA 
16501 GTGGGPGCrC CCCATTTCAC AGATCGAAAA GAQGATQCTC TQGAGGAGAG 
16551 COGTOVOGGG GCTX5QGACTC AAGCCACTCT TCCTCCCCAC TCTTOCCATT 
16601 GTGA0GGW3G TCICIGAGCG TAGCAQQGAT GTOQQQGAQG CCTCmQCTX: 
16651 ATCCATQGTT CGCCTCATGA CQGCCACOGT GGOVGCCACA GCCTGAQCTC 
16701 CCAQGCrCCr CTTTTCAGCA GTXSGATTTCA GGAGTGAAAT QGAQGCaSQG 
16751 TGGQGTGGCr CACGCCTCrA ATCCCAGO^C TTTGGGAQGC TGAGOTQQGC 
16801 AGATOVCCTG AGnAQGAGT TAGAGACCAG CCTOGCCAAC ATQGTGAMC 
16851 CCCATCTCTA CTAAAAATAC AAAAATTAGC CAGGCGTGGT GGGGCACATC 
16901 TCTAGTCCCA GCTACrCGGG AGGCTX5AAQC AGGAGAATTG CTTCAACCCA 
16951 QGAQGCAGAG GnGCAGTGA GCCTGGGOGA CAGAGCAAGA CTCTCTCrCA 
17001 AAAAAAAAAA ACAGAAGAAA GAAACTGAAT AAGGCCGQGT GCGGTXjQCrC 
17051 ACACCTGTAA TTCCAQCACT TreGGAQQCC AAQGAQQG06 GATCAGGAQG 
17101 TO«3GAGATC GAGAOCATCC TQQCTAAOC QGTGAAAOCC CATCTCrACr 
17151 AAAAATAGAA AAAAAATTAG CCQQGGGTGG TQGCGQQGGC CTCTAGTCCC 
17201 AGCTACTCGA GAGGCPGAGG CAQGAGAAGG GOGFGAACCC QQ6AGGCAGA 
17251 GGTTTCAGTC AGCTGAGACC GTGCTACnSC ACTCCAGCCT GQGCGACAGA 
17301 GGGAGACrcr GTCTiDVAAAA AAAAAAAAAA AAAAAAAAAC AAAAAAAAA A 
17351 AACAAAAAAC AACAAACAAA AAAAGAAAAT GAAAOGGGAC TnGTACTCAG 
17401 CGACTCCTX3C TCTLI ILIGC TTATTTCCTC TGTGGTCCCC AAGCCCrQCT 
17451 GAGCCCrCCr CTTCCCTGTC TCTCQGCCTT GTTCCCACTT ATACCCCTPS 
17501 CCrCATTCAG GCCTCAGGCC CCTCCCCAGA CnATCTAGC CACCTTCCCC 
17551 CTGGrcrCQC TCCTGCTGGC CTCCCTGCAG TCCAQCCAAC ACATTCAGGC 
17601 G GGGAC AGOC CTGATAAAGC ACAAOSAATC TGCCTQCATC TCmOCCTGA 
17651 AKal I IGICIG AAGCrrcrCA AAGOONCACC CraQC3QCrAG CATTCAOM3G 
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17701 TcrcosGGrT creccAcccG crcCTCFQGG GcasccTCAC Tcccrrrccc 

17751 GAGOVCCAGC CAGCreGCTT CTCTCCATTr CCTCCTCATC CTCTOGrreC 
17801 CTTCCCrcCC TGCCrCCACA GrrcTACCCC TCGTCCCrCr OTCCreCTA 
17851 TACCCCCreC TGAQQQGrrcr CTTTCCXICTX: AGGCCAQGAA TTTTAAAAQG 
17901 GATGAAGCAT CTAAGACAAC AGGGGGAACC GAAGTCAACA GTCCTCAGAG 
17951 TOGCTTTCTC CTCCCTACrC TTQGAAQGM- QQQCTCCCCA AGACCACTQG 
18001 TQGO\AAGAA ACCTGQQGTT TQQCOQQGGG TQGTCQCTGX CGCCTCTAAT 
18051 CCCAGCACTT TQGGAGGCCA AQGCAGGGQG ATCATGAGAT CAQGAGATCG 
181D1 AGATCATCCr GGCTAACACIG GTCAAAOTC GrCTCTACTA AAAATACMA 
18151 AAATTAQCQG CGCAOSGTCG CQOGCACCTC TAGTCCCAQC TACrOGGGAG 
18201 GCTCAQGCAG GAGAATCGCA TGAACCTCGG AGGCQGAGCT TGOVGTCAGC 
18251 CXSAGATTCGG CCACFGCACr CCAGCCTOGG CCACAGAGCG AGACTCCATC 
18301 TCAAAAAAAA AAAAGAACCC TCQQGTTTCG GCAGAGAGAG TTCGAGCTGA 
18351 TGreOGQCre AQGGGGCreC TCCCTCOCAT CTGAGFCrCC CATCTCreCC 
18401 TQCACTOTC TQGCTOGCAC TGTQCOVQCC TQCTAACCTC CCTXMQCXTC 

18451 AGnrccrcc tctctcaaat gagagaggat crrcTcraoG tgtagaaaag 

18501 GAOSAQGTCG TGAGTXiGGrC TGAAQQCCTC TOGTCTCCCA TAAAQOGACT 
18551 CTCCTCACCA TCrTTCCCAC CCATTCGGGT GTCCAQCACC CATGGAACTC 
18601 TCTCTGIGCC TCrcrOCraG AGGGAGACTT GACCTCCTGC TOVQGAAAQG 

18651 crcrccAAQC ccrrcrnGTC aaattcctgc c tcct gt c xs gaactcagtc 

18701 TTCCCATCCX3 AQ9SACGfifiG GnTC3GGGAA GAGAGGTGGA CAGGAAQQGG 
18751 TCCrCATCAG CGGTCCCACC CTCCrCTCCT TCCTTCGCCC TCTCOVQGCC 
18801 AGAMTCAQC GAQGAGCTOV AGGACCTGAT CCTCAAGATG TTAGACAAGA 
18851 ATCCOGAGAC GAGAATPQGG GTGCCAGACA TCAAGGTGGG GGAACTQGGG 
18901 GTCmOGGCr GGGCTX3GGAC ACAGAAAACA GGAGTCACTT TCCCTTTCTG 
18951 GAGGGATCAA CACCAGGATG CAIGIdlGI I GGGTTTCAGr CrGrOGACTT 
19001 TQGACCCCTC CAQGTCATTC TOGTAATOQC CTGACCTCTC CCCCTCTCCC 
19051 TGCCCTCCC3G GCCCXXiACAG TTQCACCXTT GGGTGAGCAA GAAGGGGGAG 
19101 GAGCCCCrrc OTGOGAGGA QGAGCACTGC AGOGTXjGTGG AQGreACAGA 
19151 GGAQGAGGnr AAGAACTCAG TCAQGCTCAT CCCOVGCTOG ACCACQGTX3G 
19201 TAAGAGAGCC GGGGTAGATG CTCCCrTCrC CTGGAQGGCC TOGGGGACCT 
19251 GAGCOTGCr CrcreCXnXaG CTCCTTCOGG QGACAGAQGC CTCCCTGGCC 
19301 AGCCAGCrcr GATCCTX3QGC CACTCGfiOOC QCCATTCreC TQGAQGOOCA 
19351 T3GAGAGGGA GGTCrPGTGG TOSGGAGACC AGGAGGOTG GTCAQGAGAG 
19401 TGACPGATTT AAAGAAATAG 0GGGGGTX3QG GCCX3QGCGCG GTQGCTCACG 
19451 CCTUTAATCC CAGCAOTPG GGAGQCCAAG GCGGGCAGAT CACGAGGTCA 
19501 GGAGATC3GAG AOCATCCTTC AAACCCC3GAC TCTACTAAAA ATATAGAAAA 
19551 TTAQCraOQC GTOGTOQGQG GC3GC3GTCnrAG TOXAGCTAC TOGGGAQQCT 
19601 GAGGCAGGAG AATGGTGTTGA ACCOSGGAGG TCGAGnTGC OGTGAQCCGA 
19651 GATCGCGCCA CTGCACTCCA GCCTCQGCCA CAGAGCGAGA CTXXIGrcrCA 
19701 AAAAAAAAAA AAGAAGAAAA 6AAAAGAAAG AAATACCGGG CGCQGreGCT 
19751 CAOsCCraGA ATCCCAGCAC TTTGGGAQGC CGAGGGGGGT QGATCAC3GAG 
19801 GTXIAGGAGAT GGAGACOXTC CTOGCTAATA GQGGGAAACC CCACCTCTAC 
19851 TAAAAATACA AAAAAATTAG CCQQGCECAG TQGTX5GGCAC CTTJTAGTCCC 
19901 AGCTACTTSGG GAGGCCGAQG CAGGAGAATC GCTTGAACCT GGGAQGTGGA 

19951 GGrrcrAGre agccaagatc acgccattgc actccagcct ggttgacaga 

20001 ACGAGACrCX: ATCrCAAAAA AAAAAAGAAA GAAATAGATG GCCCTTCCTC 
20051 AQCX5GOVQCA GTCACCGTGA CTCGAAGAAG OVTTTGATTC GGTCXAGACA 
20101 GTTACTGAGC TTCCSGTTCTC CAGQCACTGC ACAAGGTCCC GAGGACAAQG 
20151 CAGGGGAAC3G GCCHSGGCAG CCnTQGATT GGAGGAGTGG CCCCAAAGCC 
20201 CACGTATCAG TTAGGCGGGG CCTQCGTCTC CCCCAGAGCC CAOGTATCAG 
20251 TTAQGCAGO\ CaXSCGTCTC CCXXAGAQCC CACATATCAG TTAGAOiQGG 
20301 CLIGLI ICIC CCCOVGGQCC CAGGTATOVG TTAGACQGGG OLI O LI IC I C 
20351 CCCCAGAGCC CAOGTATCAG TTAGACQGGG CCIGCI ICTC CCCCAGATCC 
20401 TGTCTATCAG TTAGACTGCG CCTGCTTCTC CCCCAGAGCC CACGTATCAG 
20451 TTAGACGGGG CCTGnACTC CCCCAGAGCC CAOGTATCAG TTAGAOGGO; 
20501 CTTCOTCrC COOIAGATOC OGGGTATCAG TTAGAOQQGC CnOOGTCTCC 
20551 COCAGATOOC GOGTATCAGT TAGACQQQGC TQQGTCTCCC (XAGAGCCCA 
20601 CXJTATCAGrr AGAGGGGGCr GOGTCTIGOCX: CAGAGOXAC GTATCAGITA 
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20651 GAC3QGCX3CCr GCmCTCCCC CAGAQCCCAC GTATOVCTTA GACX3QG(XTC 
20701 GCTCTCOXC AGAQCDCACX; TATCAGTTAG AOOGOQCCTG OTCTCCCCC 
20751 AGAGCCCGOG TATO^GHAG AClQGTGCCre CATCTCCCCC GTCCCCACGT 
20801 ATCAGTTAGA CQQOGCCTCC TTCTCCCCXA GAQCCCAC3OT ATCAGTTAGA 
20851 GQQGCCreCE TCTCCCCCAG ATCCnGGCTA TCCATTAGAC ACTSXTCre 
20901 TCrCCCCTAG TCCCGGCrCA CATTTCQGTT TTGCTCCTCr TCCTCTQCrC 
20951 AQCTTCTCTG TTOQCACrrc GAAAGTGATT CAO\TAGTCC CCOGTOQCCA 
21001 CCTQQQGCCA CTCAGAC5CCC TQCCCTQCCC CTQCCTCACA OTCAAGrPGAG 
21051 TCAGGGCAAG CACAAQGCCA GGAGGAGAGC CAQGGCCACT GCC3GrnGGGG 
2n01 GQGCCroOGC TTCCACTTTA TCCXXCTCre CAQQGTOCCG QCXDOVQCTOG 
21151 GACCAGCTOG CTCAATCCCT GCCCCCTATG CrTACTTCAC TCrGTCQQGT 
21201 GQCnSGAACC AGGCAACTCC CACXaGGGTCC CCATGACCAC TTGCCTCATC 
21251 TTAQCCACCA TCrCCTCTCr CTCAGACCAC TQGAACAACC TCCCACQCTG 

21301 TcccrrcoT crAcrcrcAC Tcccrcrocc coggtcaat ocrcAAcrcA 

21351 QCAOCO^GCA TQGTCOIAGr QQCATGAGTG TGTCACICTCC OVQCTOVGAG 
21401 CCIGCI ICIC ACrOQQQCTG CTGrGTCCCT CAGAATCAGA CCTCCAQCCT 
21451 GTOCCCCACC ACCOSCCCTC TTTTTCTGCG GQQCTGGTCC ACCGTCCXXSC 
21501 CATCATGCAC TaJKTCTGG COVCGTQCCA TQGAAGQQGC TCCCCCAGAG 

21551 ccTTCAGAcr TGoaTCCCT ctxjCcgqqqg agttxoacxx: CCGATCQCCA 

21601 OQQGACrOQC TCCCTOVCTT CXTTCQQCrT TTTAOQCCAG QGTXXXrrCC 
21651 TAGAGAGAAG CGAQCCTTCC CIGACCCTGr AGGTCAQCC TCCCCTCCTT 
21701 CACACCrCAT CGCCATTCCC I IGI I I lATT TTTTCCnTC CACTTACTGA 
21751 CATACATAAT TrACTGAnT TTLI ILI I lA CTTATCGCCr GTCTCCCCCA 
21801 ACTAGAATAT AAQCTCTATG ATOGCrOOGC GCAGTGGCTC AOKXnXjrAA 
21851 TCCO«5CACr TTCQGAGQCC AAQQCX3QGAG GATCACTTGA GGTCAGGAGT 
21901 TTGAGACCAG CCIGGCOVAC ATQCTGAAQC CCCGrCTCTT CTAAAAATAC 
21951 AAAAAATTAG CGQQGTTGTGG TQGTX5GA0QC CTGTAATCCC AGCrATTO\G 
22001 GAGACTGAGG GGGAAGGATC ATTTGAAGCDG GGGAGGCAGA GG1TGCAGTG 
22051 AGGOGAGATT GTGGCACreC ACTCOVQGCX: TQQGCAACAA GAGCAAAACT 
22101 CCGiCIGAAA AAAAAAAAGG CTATATGAGG GCAGGAATTC TOGCCTCAGT 
22151 GTX3GCCCCAG GGCCTAGAGT AGTX3GCCAGC ACCCAGTAGG CAGCCAGTX5G 
22201 TGACCAGTGT TGAGQQGATG GATQGACACA AGGGAQQGAG TCAAGGGACT 
22251 GGCAAGTCTG CCQCTGCCTC TCTGCATQQG TGTCAGrnCQG OGTCT O Gre 
22301 GGCACQGCAT GGAACCGTCC TTGrCAOSGA GGAGQGACAA AGGCAGAGAG 
22351 CCAQGCTGOG GCAGCTXJTTC CCCTCOGGC AGCCCCACTG ACTGOGCCAC 
22401 CQGCTGCQQC TG<VQCGGCTT CCGGGGCGQC CCTGCAGTAG CATCnGQCA 
22451 TCTTCrGGGC GGCCX3GAAGG CQGGAAQGAT GGCACAGOVT CCXTCCATCG 
22501 OATTCCrcCC GTAQCJGAGAA QGrATGTCT AATGGACTCX: CACTTCCAQC 
22551 CCTGGCCCrc CCOACTCTTT CAGCCTX5GCC TTGGQ6ACCC TTCATQQGCT 
22601 QGrCCCQQCC CCCTCCTCAT GTACCAGTGG CATCCGGCTC CTCACCATTC 
22651 CAQGAATATG CCCCCAGCTG CCAGCGCCCC GTGTTCnGC CTCTGCOATT 
22701 TCATGCTGTG CTGATTGAGA TQQGACCCGC AOGOGGCCC OGTOGOVGC 
22751 TGCTCTXDQQG GAATOQGAGC AGAGGCTGC3G TCTCTOGGAG CCTGOGACXT 
22801 GTCCrCCrCA CGCreCCTTG TCCTCCrCAG ATCCTQGTGA AGTCOMGCT 
22851 GAQGAAGGGT TCCnTQQGA ACCCGTTTCA GCCCCAAGCA OQGAGQGAAG 
22901 AGGGATCCAT GTCTGCTCCA GGAAACCTAC TOGrGTAAGT ACTGGTCQGC 
22951 CAQQGACTGC CGGQOACTTCC CTXaGAGmOG GTGGGGAGGT CTGAGGCCOV 

23001 TCcrcccAcr crcAcrcrrcG ttcgqccaag gcovgagcxt QOOGAcrnoG 

23051 CCAQGTCrOG GIGI IGGCCC CATTTGGATC TCTGTCCCG^ AGGTTAGTCG 
23101 GGGCTAGAAG GGACCI 1 1 IG GGCCCAGCTC I IGCI ICATT CCTGGQQCCA 
23151 GCATCCCTCA CACACACACT TCCAGGGATG AQGAGCTCAC GCAGCCCCTC 
23201 CATQGGACAG GAAGACCCTT OTCCATGCA GCTTGATGTC ACTCrCTCAC 
23251 TGQGTCOVQC aXTCTCQQG CmCAAATCT GTGGCCCCCT CAGCXDCTPOG 
23301 CAGCCTGQC^ GAQGTTTGCA GAO\GGCTGA TGTTGGCTTC CTUTAQGAGG 
23351 CTGGOGGGCr GTAGAQGAQG G6TX5CTX3GCC CCTCTGCCTG GCCCTQQGGA 
23401 CTGrrOGCTC CrcrcCCAAG TQGCCCAQGC TCCCTGCAGC CATTGCTGGG 
23451 GCrCTXnGGC CfiGTCAOCAC TTTGTGAGTG CTTGTTCAGT GAGTAAQCAG 
23501 GGAONGGCTG GGGGGTGGAC CAOSGGAGAG GAACCGGCAT TGGC3GGAGGG 
23551 CrcCCTATOG TGAQOO^OaC CTGTOQGTTC ACOVCCTCCr AGGAQQGTCX: 
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23601 AGAAAAGCAG CTCCCCAAQC CnCTQCQCCT GGTCCTOVQC AGATCCACCT 
23651 TCrnCACTAT MTAAAAGCC AGTCraOGAT GCTMTAAGG CCRnXKTOG 

23701 AGnrcrACA caaaccpqca gagagaaaac ovGrcooGrc ctcaacgaca 

23751 GCXnGGTCCr QQGACAGCCA CnSCCTTCiCT CFOGCCCOGG AQQGAAQCTT 
23801 TQQQGAAQQG GCTOGTCQGA (>[ Ibl I IGCC COVOCCTGQC CTGCTCTCre 
23851 TQGAAQGCGC ACTCCCCAGA GQQGrGAGTC CCAQGCGCTG TCCQQGTGCC 
23901 TPQGCrTOVC GCrcrCACCA QGCCTGTCCG GGACCACCAT GnGGTTTCC 
23951 CGTGAQQCCr CCCTCTCATA AGAQOGCCCT TCAGAAQQGT CQQGACCCCT 
24001 GCTAGTOGAC AAQCTGACAT CTGCTOCCTG CTGGAQGnSG OTQCACCCA 
24051 GGGAGAGCCr CATAATCAGG TGGGGGGGCT GGGAGAGGGC TGGAGGTOX 
24101 AACPGCAGCT I I ICIGICAT CTCTTCAQQG AQGTGGmGC GGTraOOOGA 
24151 QGATTCrCTG AGCTCATCGA GGAATCTAQG CCCCTGATGC TQGAATTCTG 
24201 a TAGTG TAG GQQGAGAQQG GGCATATATA ATTTCACGTC CAAATQQQGA 
24251 CATTTTTCAG AGTCAAAQOG GAAGCCATTA ATAATTATQC CAGCAOGGCX: 
24301 QQGTCCQGTG GCTCAOQCCr GTAATCCCAG CACTTnOOGA QGCOGAGQCT 
24351 GGTGGATCAC AQQGTXZAQGA GATCGAGACC ATCCrOGCTA ACAOQGTGAA 
24401 ACCCCGTCrC TACTAAAAAT ACAAAAAATC AGCTGQGCGT G6TGG0QQGC 
24451 ACCTGGAGK: COVQCTACrC AQGAQGCTGA GGONQGAGAA TCGC3GTGAAC 
24501 CCQQGAQQO^ GAGOTQCAG TGAQGCAAGG TCAOiCCACr GOACTCCAGC 
24551 CTCQQGGACA GAGTGAGACT (DGGTCTCAAA AAATAATAAT TATTATQCCA 
24601 GCATQGTGQC TCATGCCTAT AATCCCAGCA OTTCQGAQG CCAAQQOVGG 
24651 Al IbLI IGAG GCXAGGAGIT CAAGACOVGC CTGGQCAACA TAGCAAGACC 
24701 CCATCTCTAA AAAAAAAAAA AA7TAGC0GG GGGraGTGGT QQGTGCCrEr 
24751 AGTCCCAGCA ACTCAGGAGG CTGAGGTXiGG AOGAI IGLM GAGTCTOGGA 
24801 QGTGGAQGTT GCAGTCAGCr GAGATTQCAC CACIGTACrc CAQCXTOGGT 
24851 GACAGAGCCA GACCCTCTCT CAAAAAAAAA AAAGAAAAAA AAGTAATAAT 
24901 AATrATGCO\ GGAO^GCAQG TQGAGQGACA CCIGGTCCTT CTGACTCAGA 
24951 GCCTGTGGTC CAGCACCCCC TAGIGGTOGA ACAAGCCAGA CACAQGATAA 
25001 QGATACATTT AGrGrCTAGT TTCTACCTGG CAAACAGAGT GACAAGATPG 
25051 GGCTTAATAC TTTGCAQCrA TAAAATTCTA GAATTCTGrc ACCCAAGTTT 
25101 AATTTCGGGT AGAGCTTTTT AAAAAAAAAA TAGAGATQ6A GTOTQCCAT 
25151 GTTGCCOW3G CTGGAGTAA ACTCCTQGCC TOVAGCXATT TQCCCACXTT 
25201 GGCCrCCGAA fiGTQOCfiCGT GATTACAQGC ATGAGOIAGC ACACCCAGGC 
25251 TCCACGTTTA ACnTGAAAG AAGATTTTAC TTCATCATCA AGTCCCAATA 
25301 TTTATCCTTG ATAGACTX3CT I IGbl I I 1 1 1 Gl I IGI I IGI TTTGAGATGG 
25351 AGTTTGACrC I IGI IGCCCA GGCTGGAGTG CAGTGGCGCA AACTCAACTC 
25401 ACreCAGTCr CCGOTCrCA CATTCAAGCA GilLICI IbC CTCAGCCTCC 
25451 CA AGTAGC re GGATTACAGG TQCATQCXTVC CAOZACAGGG GCTAAI 1 1 1 1 
25501 GTAI I I I lAT TAGAGAGGQG ATTTCACCAT TTTCQCCAGG CTCGTCrCAA 
25551 ACTCCTGACC TCAQGTAATC TGCCCACCTC AGCCTCCCAA AGTCCTCQGA 
25601 TTACAGGOGT GAGCCACTGT GCCOQGCCAT AGAGI 1 1 1 I I ATACTmOGG 
25651 ATAATTCTAG AAACTCAGTA GTAGAGnAA GTTQGAGrPQG TCLI I 1 1 lAA 
25701 AGATATCAAA AOOIATTTAC TCGnATTTT AAAAAGAGAC ATTTTQQGAG 
25751 GAAAACTAGA TATAGAAATC TGrTGAATAT GTGACAGAAT CCCAAGACTG 
25801 ATAGATQGAC TCTGCCCTCT GAACAAGGCA AAGAAAAATG CAAAATGAAA 
25851 GCCrcrCTAC CCAGATCTGC TGQGQGATGA CTGAGGTCAA CACAGAAGGC 
25901 CCrCAQGCGG GGCAGQGTQG CTCAOGCCTG CAATCCCAAC ACTTTAGGAG 
25951 GCTGAGGTGG ATQGATOGCT TGAGCCCAGG AGnTGAGAC CAGCX^GGGC 
26001 AACATQGTGA AACGLIGI 1 I TTATAGAGAT AAAAAAATAC AAAAATTAGC 
26051 TQGGCGTCGT GGCATGTGCC TGTAGrCTtA GCTACTCAGG AGGCTGAGGT 
26101 QQGAQ6ATCG CTTCAGCCTG GAAGGCAGAG GTTGCAATGA GCTGAGATTG 
26151 CACCACTGCA CTGCAGCCTG CACGACAGAG OGAGACGCTG TCTCAAAACA 
26201 ACAACAAAAC CACACACACA GAGAGAAGGC CXTPGATTAG GCTGATAGTT 
26251 GGAGGATCTA GGGAAGTCAG CRSGGTCAGA CTCTGAGCAG CTCCAGAGGC 
26301 GGTGaGQGA GGTTTAGACT TCATCTCTGG TCAATQQGQG GCCAGQGAGG 
26351 C3GTTGGQQGC TGAGACTQQG GGCTGAGAGA CCQQOW3GA GCAACTQCCG 
26401 IGATGTAGGG AGGCXIAGAGG GAGGCCAAGC TTGGGGCAGr GGGTCAAGGG 
26451 QGCTTTGAGA GATGTGGGAT TCAGATTCCT GTGTGTXnGA GGGAGAGTGT 
26501 CrcCCTGAGr GCATATTOG ACXOGAGGT aXTCTGTCX: CrQGTGTCGC 



FIGURE 31 



26551 CTGAACAQGA AAGAAfiGGTT TCGTCAAQGG GGCAAGAGCC CAGAQCTCCC 
26601 CX3G0CTCCAG GAAGAOGAGG CTGCATCCre AGCCCCTGCA TGCACCO«jG 
26651 GCCACCCQGC AGCACACTOV TCCCGCGCCT CCAGAQGCCC ACCCCCTCAT 
26701 GCAACAGCGG COCCOQCAQG CAQQQQQCnG QQGACTCCAG CCGCACTGCC 
26751 GGCCCrcCCX: CATGGTGCTG CATGACCTCC fiOSCAOXAC GTCCAQQGAC 
26801 AGACTGGAAT GTATCTXIATT TOGOGrcrPG GQQQCAGQQC TCCCAC]GA,GG 

26851 ccATCcrccr CI ICI iggac ctccttggcc tgacccattc tctcqqgaaa 

26901 COXjGTCCCC ATQGAGCCrC AGAAATGCCA CCCXSQCTOGT TCGCATQGCC 
26951 TQGGGCAGGA GGGVGAGGCA GGAGACCAAG ATGGCAGGTC GAGGCOVQGC 
27001 TTACXj<\0\AC GGAAGAGACC TCCGQCTX3QG GCGQQQOVQG CCTOQCn^ 
27051 CreCCACAQG CATATQGTOG AGAQQQQQGT ACCCTCCCCA COTOOGOTG 
27101 GTX5GCACCAG AGCTCTTCTC TATTOVGACG CTQGTATQCaG GGCTCDQGACC 
27151 C CTCACT GGG GAC AGGGCC A GTCmOGAGA ATTCTCATTC CI 1 1 1 1 IGI I 

27201 OTcrnTAcr migiiima aoctqqqqgt tgqqqgagag QcocrecrnG 

27251 QGAACATCrc AOGAGCTTTC CTACATCTTC Oal b bl I LO C fiOCACfiOOCC 
27301 AAGMTATTT GGCAGCCAAG TCGATQGAAC TAACTTTCCT GGACTCTCTT 
27351 TCGCATTOSG CGTrATCTCG AAAGTCGACT GAAOSGAATC AAGCTCreAG 
27401 CAGAQQCCPG AAGGQGAAGC ACCACATCCT CCCTQCCCAT CTCACTCTCT 
27451 CXXTTCATCA TGCGCCTAGA GCTGAQGCre GAGAAGACAC CAQQGCTCAC 
27501 TTnGACOGAG GQC3CATQGAC GC3GACAQQCC TGTXSGCCrre CQCATQCTGA 
27551 AATAACTGGA ACCOVQCCTC TCCTCCTACA COGGCCTACC OVTCTCQGCC 
27601 CAAGAQCTX3C ACTCACACTC CTACAACGAA GGACAAACTC TCCAQGTOQG 
27651 AQQGATCAC3G AGACAOVGAA CCTOGAQQQG TGrOOWDGCT GGCAGGTOGC 
27701 CrCTQGQQCA ATTQCCTXZAC CCTGAQGACA TCAQCAGTCA GCCreCTCAG 
27751 AGCXjOGGGTC CTCGAGQQCG TGGVGACACA QCrCTTCaSG AGCAGCCTTC 
27801 ACCTTCTCTC TQQGATCAGT GTCCDQGCTGG CCGACGPOGC ATTTCCTCAC 
27851 CGAATQCrCA TAGAQGrPGA OCCCCACAQG GTCACXjOVQG ACrC3QGAlCAC 
27901 TCCCCTGGAA ACATOGATOG AOW3GGCTT TTOGGCAOVG GrcrOGGTCr 
27951 CCTCTTCGAG GAGGGCTTGr TTQGAGAAQG GAQGCTXSGCT QGQGGAGAAA 
28001 CCCQGATCCC GCTX3CATCTC CGCGCCTCTG GGTGCATCTC GGGTCCTCAT 
28051 CTGrreCACA CAGCrCACrC GTATCTCCTIG CACTXiGrACA TGCATCTUTA 
28101 ATACAGTTTC TAOJrCTATT TAAQQCTAQG AQGCGAATGT GCCCCATrGT 

28151 cficroasTcc aujIiilkjC cc3QGcrccrc TQQQCTAAQG O^GTCro GC C 

28201 CX3AAGCTTAA fiAAGTTACTC GGTALIGI I I TTAAGAACAC TTTTATAGAG 
28251 TTAGTCGAAG GCAAGTTAAG AGCOVATCAC TGATCCCCAA GP b l I I C I I G 
28301 AGCATCTX5GT CTGOGGGGAC CACmWC GGACCOVCCC TTOGAAAGCT 
28351 CAGGGGTAQG CCQVGG TTGGG ATGCTOAOCC TGrCACTCAG G Lil l II U»I I 
28401 GG CATOS TPG II 1 1 IGAATG TAGCACAAQC GATCAGCAAA CTCTATAAGA 
28451 GTUTTTTAAA AATTAACTTC CCAGGAAGTG AGTTAAAAAC AATAAAAGCC 
28501 CI I ICI IbAG TTAVVAAGAA AAAAAAAAQG TTTCrGGGTA C AI I I I C I C C 
28551 ATCTO GATAT ACCI ICI I IC TCAGCAGCTC GAACAGCPOG C I I I G I I CA A 
28601 1 1 1 ICICGAA GOGTCTGAQG CACCCTAAGT CCCTGAQCAG GACAGraGTC 
28651 AGAAGTCGTX: TTCGGGGAGG GAGGGAGAGG GAAGGGCFGG CTCAGGAGGT 
28701 GACGQGGCTC 0\GrCCAQQG TACAQCTGAG GCTCCTOGQC QQGTCC3GTGG 

28751 ccAcrccrre ggaagaactg cctxjtttcac aggggctcag gatgccaagg 

28801 TCrGGTCCQG GTAQGAGCCA TAGCTCCTGC mTOGGGCA GAGGTCCCTG 
28851 TQGTCTCACA GGAGTGCCrG TGACACOVSC CCAGrGACCT CCCAT<XGOG 
28901 CTTAGaTTC GACACTGGTA CAGACTTTTC GGAGCCCACA CXiCIGMCC 
28951 OVTGGTACAG OCCTCCAQGG CAGGGACGAA AAGAGTCATC CTTAAQGrCA 
29001 CACAGCCCre AGCTPGAATC CAAGCTTTGC TAOTAAAAA TTGPGTGACC 
29051 TTTCGCAGGT OVT TCGAGGA GCCTCAGTTC CCTTATTCAT TTAATGOGAA 
29101 TGTTCCaJTG GGGIGI 1 1 ICi 1 1 IGI I IGI I TGAGAI 1 1 1 I TGAGACTTCC 
29151 TCTGTCACa: AQGCrOGAGT GOVATQGOVA GATCTGQGCT OOGCAACC 
29201 TCTTCCrCCT QGGTTCAAGC GATTCTCCTG CCTCAGCCTC CCAAGTAGCr 
29251 GGGACTACAG GTCCCCGCCA CCATGCCCGV GCTAAI I I I I TGTACmTA 
29301 GTAGAGACGG GGTTTCACCA ICI IGGCCAG GCTOGTCrrG AACrCCTCAC 
29351 CTCAQGreAT CTQCCCAaT CjOGCCTCCCA AAGTCCTQGG ATTACAQGC3G 
29401 T GAGCCA COG CXSCCOVGCTX: CXXATGGGGT TTCAATGCAA AOVATGCAAA 
29451 CXjII I ICJblC TQCrcrCACA CTACAACAGT GAACACAGAA GA L I IC I bKa 
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29501 ACOSGCTCQG OOaSGTOQCT CACjCCCTCTA ATCCCAGOVC TTTQQGAGQC 
29551 TCAQGAAQQC QGATCATGAG GrOSGAGATC GAGACCATCC TOGCTGACAC 
29601 GGTCAAACXDC COTCTCTACT AAAAATACA (SEQ ID NO: 3) 
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context: 



DNA 

Position 

2082 ATXKyVGAGrCO\QGGCAAGCAGQQQGAAQGGCATCAQGnX3GGCAT^ 
AAGCCTXiGGA(:AGAGATCGGGGTCrO\GGCTlGA(nxn'<^ 
TCTAGCOIAGGGTDVTCGCTCAAGGrcAGGGCTXmX^^ 

GCCACACAOTGACTTTCAGAACATXyNGTCrCAGGGGATOTC^^ I ICACCCCT 

CATTCCCCTCATTCCCATCCCCI IGCI I M I I I IGAAACCGAGTCrreCTOCATCACCCA 
[G,A] 

GOGGAGTOTACnGGClGTX^TCnXSQCrCACTO^CCrC^ 
T TOiCrro O CTCAGC iCTaXGACnVV^^ 

AAI 1 1 1 ItilAI 1 1 1 lAATAGAGACAGAGI I i I GCCATGTrTAGCCAQGCTAGrCTCGAACr 
TCrGACCrCAQGTWCCACCTXK:CTCX3QCCTCCOWVliTGCTGQGA 
CQ\a^TGTOGGGCXXArOOO LI I GI 1 1 I Cj AQVGAOGTCAATCAGGQVGGGCTOGCTOGA 

2748 CAAGGAGGGOXZCaGGAGGAGGGGGCnGGGTUrcAAC^^ 

CTAGTOTCTAQCrCAGCTrCCAQGAQGCAGnn-aKSAGTGCrCTGCTUTOV^^ 
ACrCATCACrCACAGQGCTGCATGCRniCTGQQGCrGAGCnGACCC^ 

TTccAGreaxoGGQCcrccAQGcrrcKXiccrOTaurcc^^ 

TTCTOXnX3CrrC(XlXnX3AAGCCAQCAGGCAGAAGT^^ 
C-.C,A] 

TAOXJAQGCCTAGAGGGCreGCATmSGCAGAACOGATST^^ 

GACAGAOTGAGrrTQQGTXJnGGCAATCCCGCjrAGAQa 

GGAGGreGGACCCACAGClGaxnGGCrACCmCCTGGTAGCC^ 

TCAAQccTTCTcra:cTrcrrTTCTCftQGrrcc^ 

TCACTGAA(K>\ATOGAQQQQGGTXXAGCrcriX^^ 

8868 TCCTlOTQQGrcrcAGrCCreGLI(3l ILIGIATLI I ICI ICItiCIGACil ICI lAGCCTAGC 

TTAGQGTIXKiG^CGGGGCTTC^GAGATCrrGQGMQGAAQGGATTTATGTCC^^ 
GQGAGAGraOTCCTGGCATCQQQCOQQQQCAraGTXXiCAQQGira^^ 
QQGGCTACnxnWAAGAQCTOWSGAGQGTT^TCn^^ 
TCATTAGCTACATTO^CTnQCOGQQQGCATAQQQCntyWNG^ 
[G.A] 

GCCTACnXSGGAQGGAOVQGAATCTAGGCAGGCAGATAATACAGCGTGGT^ 
OGQGGAGCOAGAQGaXJrAGGAGTGCaiGGGGGCTCGGGAAGTCTC^ 
ACmTOVrrmSGTCaGAQQGATOVGTAGA^^ 
0GTGGAAGAGGAlTnGCCTT3GCATAGGanCTACrCTGAGTC^^ 
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GGCTCAAGCXlXDCAGTCGGCAGAOGAGGGGGGIGbl lAGTCAAAGATSATQCFQGAAACA 

10187 TrCCCTGQQQGAACAQGCTQQGGATCreGCTOVQGrrTCACC^^ 
AaTGTCrrTATCCreCTGGAGCCTCTCCTT^ 
ATGGGGAAACTCAGGCAO^CAGAGGGGAGGGGTTCOMSAAM^ 
GAGGG^TCGGATQGGMGGGGA(^TTTTACXXDQGGTAO^^ 
[G,A] 

GTCCrcrACCAGCTroGCrCCACCTCCI I ILIGI IGIAGTCCLI ILI I ICCCCTGAAGTC 
Lib! IGlLIGCTATCCCCTAQCCrCO^CAAAGAAACJGAmTATCmCLIUal ILI IbG 

AGGQCCAGQQGCTQGA<7^QGreGATCMCAGAGG(:ACGAQQGC)^^ 
TWTCKXXiCCCAAGAAGAGAGAGrrcAQGCreCC^^ 

10216 TTCCCTX3QQGGAAOmJRX3GGATaX3GCRyM^^ 

ALI G I L I I l A TCCTCCrOGAGCC R- I LL I I Ili COTCAGATTAAGACr L I L I I HaC ACAT 

ATOjQGAAACTCAOKACACAGAQQQGAQQQCrrreCAGAAAATCCC^^ 

GAGGCATCQGATQQGAAGQQGACATTTTACCCQQGTAaS^ 

GTACCAQCTTOQCTCCAOCrccr 

[ T,C] 

TLIGI IGIAGTXXLI ILI I ICCOCPGAAtalLCICil IGlLIGCTATCGCCTAajCTCCACA 
AACWVACXSAGrrTATCrrACLIGb! I CI IGQGrAAAQCCTCATCAQGACCCAQCTAATCA 
CAGreAAQQGCTrCCXnX3QQGCAGAAa3GTTAGC3GCOVQQQQa^^ 
AGAGGCACiGAQGGGGCTGAAGACCrGCCI IGlCiATTCrGGCCCCAAGAAGAGAGAGniGA 
QQCTCCl^TGAGAQQQCrGQ(nX3OTO«QQCX3GCCO^ 

moo ATay\TGCcrAGaTx:crAGmTcrACCAAQ\TOTCT^^ 

GAGOTiGWGnXZAAQQGAGAQQCrAQQGnTCAGATAAGA^ 

GTCAAATSTCCAiOOCaxreAG LI I LKjIfa COCTCAAGGGreTCTCAGATTC C I Iblb l b 
TCCCTCGCACATAGTAQGCAATCAAGAAAGrXSCCALIWal 1 1 lATQGnAI Ibl lATACG 

GCAcaxKicrirrcreccaxLAGCCTCCCTcrccrcTTCTCccTTCcrci i ici icrcrc 

[G.A] 

ccTTaoiccrcccr<rrCT<XAGOKTCCTQQQGr<XGrr^^ 

COVAGGAGGGAGGCCACAGGTCGATCGrcOGGGGAlXXSGGTC^^ 
CTTXKaCTOGGACCrrACCAGrCATCrCAGCTTCAGC 
QQGCCD\GAAAQOVQQQCAGACax:CAGCCTATTAGCTCATT^ 
CTACrATATCTOQCrcAClCmOVCXrOVQC^ 

11650 GTOGCTGACXTrACACCCrAGCIGIGICATCC.IU.1 IGlCGG^AGSXTSGGGRiCCATC 
CATOO'CRyWVCCCCATCAGCCCAGATCCaSAQQGaxyW^^ 
AGCAGAGTCCCTACAATCmCTCTG^GTCCC^GCAGCAGQGACATCm^ 
GGTXmK3ATC5GAACn3GAGAAAGGTTTTGATTTQCXTTTma:C^^ 
AGQGAAGGGGO\GAG(TAGCCrAGGAAGAACraGCrCCO^GaXXmX3Q^ 
[G,A] 

CaXjrAATCCCAGCACrmXX^QGCOGAQGTOQGni^^^ 
ACyVCOVGCCreACCAACATCGOSAAACCCTXjrcrcrACTAAAAATACAAAAAGr^ 
GClGTCGTlQCX^VGACACCrGTAATCCCAACrACTCXSQGAGQCl^QGC^^ 
GAAOCTGGGASGro^^ 

11656 GACaTACACCC(:AGCrcTGrCATCaGCrRn"CCCAAGGCCrQQGGTGC(^^ 

TCTCAAACCCCATCAGCCCAGATCCOGAQQGCKyVGATGGrACCrCTUrAQGAT^^ 

GTQXTAOVVTCTrACrCTCAGrOXAQCAGOVQQGAC^^ 

GGATCGAACTGGAGAAAQGnTTCATTGGCrrTQGGCX^ 

GOiGCAGAGCTAGGCrAGGAAGAACIOGCTCCI^^ 

[A.C] 

ATCCC^GC^OTKKiGAQKCGAGGTXSanXjGATCACC^^ 
GCOCAGCAACATCOGGAAACCaUTCTCTACTAAAAATAOW^^^ 
TCGCAGAOVCCrerAATCCCAACTACrCQGGAGGCTXSAQGCAGGAGAATCT 
GGGAG(nX3GAGGCTGOVGTGAGCX3GAGATCAGGClCATTQCACT 
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GTXy\GftCRTCTO'CAAAAA/W W\AA AA A AMCCAAA/WWW\AC^^ 



15095 GGCTCrmXMTATCAQGAAGAAGGraGAmTCAO 

CXjGAQCIObI I l/VGTGAGCAGAAGrrCCATATATACCATC^TTCaii^^ 

AAOGTACLIOil I AGACTCTTTCCCAAQGAAACCCAGAACCCaGGAQQCjrayNGGCCTT 

GrrcnxK:cGCCCC7xnxn"CCTO\GCAcro\GCAaxm:ccAG(^^ 

ICTl 

GGAGTUrrnnCGAGTXSAGTAOVTGAC^^ 

TTTCriTCCAQGCaTXsGATtnATCQGCCACro^^ 
GGAAGGTXyNGreCCAGGGA TGCCAGC AGAGCroGGGOGGGrCQVCn^ 
a^NQ^GAlXXIAQGCrcnGLI 1 1 1 KjICXnTAAGPGGLI 1 1 IGAAAGAGODOVCOQGCr 

15116 GAAGGraSATTATCAGGAGAAGGGATCAOnTTGAAGCCaS ^ I l A gTCAGCAG 
AAGrrcCATATATACCATCATTCCreOOGTCajrCTCrcGCACQ^ 
CCCTOXSGATGAAQGAQtil 1 1 ICjIACLIGI IGAGnOGAAACGTACCIGCjl lAGAGTCTT 
TCC(^GGAMCC0^GAACCCCn3GAGQGTOGAQGCC.I I (a I ICIbGCCGCCCCTCrcTCC 
TCAGOOOVQCACQQQGC<XAQCATCX3QQCAACTACGQCX^ 
[A.G] 

CATCACAGAQGAMGAGGTTCCaXKiAQGCCTO-CCTXsCAGCCas^^ 
CAGAGGTCGCrereCCI C jl I GG GGACTGATgroAGC A I G I I I L I 1 1 C CAGGCaTGGATG 
TATOGGCCACnOGCCTCAOLiI IGIACTQCI I HjICrATQQGAAQGTGAGreCCAQQGATG 

TTTTGreCTTAAGreOCrrTTCAAAGAGCCX 

15428 AAAGAQGrrCCCKKAGGCCTCTCaTSOVQCCOQCTQGA^ 

GTQCLKjI KiGOGACrGATCTGAGCAIbl I ILI 1 1 CCAQGCCTraGATGTATQQQCD^Cr 
QQGCTCACGrrcrALIlbLI I IC3lCTATT3QGAAQ(nX3/«nX5CCAQQGATCCCAGO\GAQ^ 
GGGGO SGCniCD Vgre^VSGOGGGO^OGGGCGAOGGATSO^^ 1 1 1 IblC CTTA 

AGreGCriTTGAAAGAQCCiCAlC3CTCQCR^ 
[A.G] 

AAQQQGAQGrreCCGAC3GATQ(3ltil IGCCACCXjCCCCAGCCAGAGTGCTCCCCJGTCCCrcr 

GrcccrrGAaa:(:ATTCACTrATreAGCc:A ig i g 1 1 CAcrcccrrccrcATTTATTOGAC 

AMTTOTCOTCACCCCrACCCraSCTOVGGCTCGACCaX^^ 
TATC3QCnWCCCTX5CCC3QO«3GrcTlK:CTX5Crcra^^ 

15827 CCAGAGTCCTCCCOCTCCCTCrcrCCCnGACECCATTCACT^^^ 
CrCC(XrXCTCATTTATTC]GACAAATTX;rCCrrCACCCCrACC<^^ 
CFQQQGACACCCAACGCreAajrATGQGreATCCaXKiCCGCAGGrGreca^ 
GACO^CACrAAQQQGCAQQQQQGAATTTCAGreAACATCTTCCCAAQCCCCA^ 

[T.C] 

TQGCCXKIQQOTSGCAGTAACGGGQGraGATOIACCCAGCCrCAT^^ 
CATTCATTCACTOWCAI I IGI IGA^(:ATTTAO^G^UTlGAG^t]AGG^CaTC^CATGTA 
ATCGGAGCX:CAGACCreCX:CCCrAC0CCTXKX:ClCOVCO\AQQ» 
QGCAOVQGra3M3G(XCraGA(XOVO^^ 
TTCATOGACXSATTTOmxraOCCaOZAOV^^ 

16135 aKnGGCAGTAAGOGCGGTOGATQCACCCAGCCTCATTCrrTCCCT 

CCCAGACCreCCCCCTACCCaimXICAOOWiQGA^^ 

CTCGAQGCCCTGGACCCAOVrCCI I IGlCIGCCTCTCCACCCCACAGreCCanTCATOG 

ACGATTTC^TCCTQGCCCrCCACAQGAAGATCAAGMT^^GCCCGTGGT^^ 

[G.A] 

GTTGOCRXiGAGGGACXCCTXSAATAGATGGAGGGGAGGGA 
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a^CXnXZAOCTGTCGATQQGMCanXSAGATCCAQQGrT^ 
CAKATTCA(^(:ACrcCTrCACCXK>VQGCCT«:CTQQQGrC^^ 

16557 TTCCXTOiGAQQGACCCCreMTAGATQGACQQGAGQGACrcr^^ 

ATCArn^C^Oj^CrcaTCACCGCAGGCaXKlCTCQQGrCAGT^ 
AGCCOTOGACrCAAQQCXSAACnCTCCrrCCCCCACrCAGGOT 

[A,G] 

QQQQCnSQGACTOVAQCX^OaTXXlXXCOVCTCnm^^^ 
QCCTAGCAGQGATCTXXQQGAQGCCTCrntCTCA^ 
GGTCQCAGCCACAQCCreAQCTCaZAGQCTCCTC^^ 
MTCGAGGGGQGGTGGQGTX3GCrO\a;GCTCTMT 

17375 Gf43aOCM3Cfit3GGC3aSfiaCfiG^^ 

AMCCCCATCTCTACTAAAMTAGAAAAAAMTTAGCCGQGCGTCG^^ 

ACTCCCAGCTACTCGAGAQGCreAQCXIAQGAGAAaXSGGrcAACCC^ 

TOVGTCAGCTCAGACCGTCCrAaTSCACTCCAQCC^^ 

Q\AAAAAAAAAAAAAAAAAAAAAAACMAAAA/WW\ACAAAAAACAACAAAOW\AAAA 
[C.T.G] 

AAAATCAMaSGGAOTXHACTCACKlGACrcaXsCTCT 

TCCCCAAGCCCTGCre^GCCCTCCTCTrCCClCTCraXSGGC L I IG CCACnATACC 
CCrraCCTCATTCAQQCCTCAQGCCCCTCCCCAGACmTCTAGCC^Ca^^ 

AAAGCAO^AQWVTCroOCrGCAT L I L I Ki CCrGAA Li I I I CilC I (» AAC5LI ICICAAAGOC 

17375 GAGGCCAAGGAQQQGQGATCACjGAQGrCAQGAGATOGAGACCATCaT^ 
AAACaZCATaXTACTAAAAATAGA/WWWVTTAGCOKjQCXni^^ 
AGrCXX:AGCrACTX:GAGAGGCTXyVGGCAGGAG^\C)GGanX^ 
TCACniyNQCreAGA(XGT«TACTTXy^CTCCAQC 

CAAAAAAAAAAAAAAAAAAAAAAAAOVAAAAAAAAAMCAAAAAACAACAAACAAAAAAA 
[G,C3 

TCCCCAAGCCCTGCTXSAGCCCTCCrcrrcCCTUrCTCTXSGGCCI lb I IGCCAOTATACC 

ccrraccrcATTCAGGCcrcAQGCcccrccccAGAcrrATcr^ 

CrCGCreCTQCTXXXiCTCCCTCCAGrCCAGCCAACACATTCAQGCG^^ 
AAf^QCACAACAMTCTGCaXXATCTOTOiaX^ 

17375 GAQGCCAAGGAQGGOKSATCACGAQGTUNQGAGATCGAGACCATCCreGCT^ 

AAACCCCATCTCTAOAAAAATAGAAAAAAAATTAGCaSGGCGrXXnimmiGCC^ 
AGrCCOVGCrACT03AGAGGCTCAQGCAGGAGAACGGCGTXy\ACCC]QQGAQQ0\GA^^ 
TC^GRyVGOGAGACGGnKTACrcCACT^ 

[A.C.G] 

AAAATCAAACQQGACnxnACr(:AGCGACTCCTGCrCr<_l ILIGCI lATTTCCTCTXmQG 
TCCCCAAGCCCTGCTGAGCCCrcaOTCCCrcrcrcreGGCLI IGI IGCCACTTATACC 
<XTreCCT(^TTC^QQCCTOVQQCCCCrCCCCAGACTrATCr^ 
CTGQCreCTGCTOQCCTCCriXX^VGTa^^^^ 

AAAGCACAACAAATCTCOCroQVT L ICI I b OCTCAA fa l I Ifa l C I Ci AAGLI ICICAAAGOC 

17375 GAQGCCAAQGAGGGGQGATG^GGAQOT<:AGGAGATCGAGACO^TCCra^ 
AAACaiCATCTCrACTAAAAATAGAAAAAAMTTAQCCQQGCGK^^ 
AGKXXAGCrAaX3GAGAGGCTGAGGO\GGAGAAG^^ 
TCAGreAGCTGAGACCGrecrACTCO^CrCCAQCCreGGa^ 
(:A^^^^AAAA^^^^AA^AAAA/W^ACA'\AAAAAAAAM<^ 
[A.C.G] 

AAAATCAAAOQQGACTTCrACTCAQGGACTCCrecrCR.1 ICIGCI lATTTCCTCrcrGG 
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CrCXXnXXTCCTCQCCTCCCrCOWJrcCAGCCMCACATTCAGGCGQQG^ 
AAAQCAOVACAAATOXKICTXXyaCTCnxXICTX^ 

17511 GAGGCTCAGGO\GGAGAAC]GGQnT]MCCX]GGGAGGO\GAa 

AAAAAAAAACAAAAAAAAAAAACAAAAAACAACAAACAAAAAAAGAAAATGAAAGGGGAC 

TreTAcrcAQGGAcrccTxscrcrci icigli iATrTccrereTxxn"GCC0W5ci:crccT 
GAGCCcrccrcrrcccrcrcrcraQQCx.1 igi iw:cAcmTACcccrnQccrcATTO« 

[C,T,G] 

(XTCAGQCCCCTaXOVGACmTCrAQCX^VaTrCO^ 
TCCCTCCAGTOCAGCCAACACATTCAGGCQQQGACAQCCCre^^^ 
GCaiXATCraTGCCrcAAlbl l IGlCICWVOCrrCTCAAAGCCACACCCRSQCGCTAGC 
ATTCACACOTarCCQQb 1 1 C I C/ICACCCDGCTCXn^CraGGGCCGCCTXiACTCCCTTTCCCG 
AGCACG«3CCAQCTX3QLI ICIGICCAnTCCrcCrCATa.l(3lW3l KjOGTCOCrcCCT 

17928 ATOXKiCRXIATCrcrnGCCrcAAtil I IGlLICaAAGCTTCTCAAAQCCACACCCrQGCXjC 
TAGCATTCACACCTCrCaSQb I I L I (jCCACCClGCTCGrcrQQQGCOQCCTCACrCCCnT 
CCCGAGCACCAGCCAGCrOGL I I C I G I CCATTTCCTCCTCATCC I G I I I (iCCTTCCCr 

(XCTKCTCCAOVGnOTACcxcreGnKicrcrcr^ 

TCTCrrrCiCCaXyVGCCCAQGAATTTTAA^ 
[A.C] 

CC}GAAGrO\ACAGTCCTCAGAGTQGLI I ICIGCTCCCTACTCTTGGAAQGATQQGCTCCC 

C^GACCACTGGTXiQOWVGAAACCreGGGriTQQCCGQGGGTXKn^^ 

ATCCOVGCACTTTGGGAGGGCAAQGCAGGGGGATCAlXyV^^ 

cjoacrAfia^asGTGMAaxLCGrcrcr 
17968 TCfiAfiocaKa>cc(n<3aoxTAB^ 

GQQQCCGCCrt^CTCCCrTTCCCTGAQCACCAGCCAGCnQGL 1 1 L I G I CCATrTCCTCCrC 

ATCLiGiGGi iGCcrrccCTCcaxxxTCCAOVGnxnAcccaxsGroccTor^ 

CTATACCCCCnGCTGAQGGGIGILI I ICCCCTCAGCCCAGGAATTTTAAAAQQGATGAAG 
C^TCTAAGACAACAGGGGGMOGGAAGTCMCAGrCCTGAGACnY^ I ICIGCTCCCTA 
[C.G] 

TCnGGAAQGATCGGCTCCCCAAGACCACTXiGTQGCAAAGAAACCTQQQGrn^ 
OJraGTCGCTCACGCCTCrAATCCOWX^C^^ 

ATCAQGAGATOGAGATCATCCTQGCTAACAaSGTGAMCCCCGrcrCTACr^^ 

AAAAATTAGCOXMG^ClQGroQOQQQCACCTGrAGrCCCAGCTACrOQQGA^^ 

AQGAGAATCGCATGAAGCniGGAGGGGGAGCnGCAGT^ 

18574 GCam3CCAO\GAQCGAGACTCCATCrCAAAAAAAAAAAAGAACCCTX3^^ 
GAGAGAGTTQGAGOGATCnGGCECrcAGQGQGaGCTCCCrCCC^^ 
CrcraCCTGO«TCTTCTGQCTCQO\CTXnXX:0«K^^ 

TTCOCXnOXnXIAAAlXyVGAGM^GATLI ICICIGGGrcrAGAAAAGGAOGAGGrGGTGA 

GraQGTCTGAAQQCCTCTGGTGTaiCATAAAQCXy^CrCT^ 

[T.C] 

TCGQGlTtn-CCAGCACCCATQGMCrCTGrCTXnGCCTCTGrca^^ 
TCCTX3CrCAGGAAAQQCrcrcO\AQGCCI IGI IGTXSAMTTGCrQCCTGCTGTCCDQGAAC 
TOOCrrcCCATCCJGAGGGAGGAAGGI I KJGGGAAGAGAGGTGGACAGGAAGGGGrGCr 
0\TCAGClQGrCCCACCCTCCTCrCCrrcaTCGCCCrCTCCAQGCC^GAAATOVG^ 
AGCrCAAGGACCTGATCCTGAAGATCmGACAAGAATCCCGAGACGAGAAT^^ 

19654 AGAGGGAGGTCrrerGGTOGGGAGACX^^GGAGGLI IGGIGAGGAGAGreAC TCATTTA AA 

GGCONAGGOQQGCAGATCAGGAGGrCAGGAGATCGAGACCATCaTGAAACCC^ 
ACTAAAAATATAGAAAATTAGCTX5QQGG I GG I GGCQOGCjOGGrrcTAGTCCCAGCrACrCE 
GGAGGGlGAGGCAGGAGAATGGnnGAAGGGQGGAGGn^GAGTTIT^^ 
[C.T] 



FIGURE 3P 



GGAQGCClGAGCmXnXiGATCAGGAQGTO^GGAGATCGAGACCATCCr^^ 

GAAACCCCACCrOACTAAAMTACAAAAAMTTAGCaKKSCGCAGTX^^ 

ACnx:GO\GCrACTGGGGAGGCC3GAGGCAGGAGAAlX]GCTTX;AAC^^ 

21498 QGnrOQCTQGAAa:AGGO\Aam^VOQQQGTXXCCAT^ 

C(ArCTCCTOCTCrOVGACCy^aT5GAAC^CCTCC(^a3^^ 
CACTCCOXntrCCCCTCGTCAATTKnDVACTCAGO^ 

GTCTCTCACCTCCCAGCTCAGAGC C I GLI I CT CACrOGGGCT G CrSTUT C CCrCAGAATC 
AGACCrCQVGOCTCTXaCCCCACCAOOQGCC CIb l 1 1 1 I L I ia CXiGGGCrCgreCAOCXJrOC 
[C.T] 

OTCQcnc ccrcrcx icoasGSfiGTCco^ 

TTCOTCGGLI 1 1 1 IAOGCO«3Q(JrCCCCTC<TAGAGAGMGCX5AGCCrrCCaX3ACCCr 
CTAGCmCAGCCTCiOCLKiLI ICAQVCCTCATGGCCArTCC LI 1 1 lAM 1 1 1 IC CTT 
TOOVCmCR^CATACATAATrrACTC A i M I I C I IC I I l A CmTOGCCTCTXnXXXC 

22729 GCCCreCftOTAa^TCnQGCATCTTCTaSGCQQCCQGAAQ 

ATCCCrCCATQQCATTOnTGCGOTAaDGAGAAQGTATCTrcr^^ 

GC<XraK:CCTGCClCACTaTTCAGCCTOQGCrnQCX3GA^^ 

CCCCCrccrCATCrACC^OQQCATXIX3GCTOTO\a:AT^ 

[G.A] 

CACTQCjQKCCCCnXSQCAGCraTCrCQQQGMTCDGGAGCAG^ 

GccraoGAcarnxscrccrcAciGcrccCTTurc 

TCAQGAAQOJrrCCTrreQGMCCClGTTnGAGCCCCAA^ 
TOTCnKTCCAQGAMCCTACTCGKnAACnACrQGTXiQK 
CCTXXy\GrrQQCJraQQGAQCTXnXWQQCCCATCCrca^O"C^^ 

22757 CQGOQGCCQGAAGGaSQGAAQGATQQCAO^GCTaXICCTCCATQGCAT^ 
AGAAQtnATCrrCTAATCGACrCCCACTTCCAGCCaXSGCCCTCCCCACT 

QQCc^^Ga3GACcc^^o^TOX5CTCGrcccQGCccccTCC^(:^^ 

GCrCCrCAOZATTCCAQGAATATXmiCCAGCTXKX^VQGQ^ 

CATmATTXTCranXSATTGAGATOQG^ 

[C.T] 

GQGGMTCGGAQCAGAQQCTCOCTXn'amSAGCCTGQGACCTC^^ 

TTGTCCrCCrCAGATCCTCGTTGMGrCC^TGCTXyVQGAAQCXJrrC 

TCAGCrOAAQCAaSGAQQGAAGAGOGATCXZATCraXKTCOVQGAAACC^^ 

AGrAOGGIXiGGCX^VQGGACrcCCDQQGCAaXDt^^ 

OATCiCrcCCACTOXIACnn-OCTTQQGCONAQ^^ 

22779 ATQGCA(^(KATCCCTCCATQGCATFX5aXSCan7VGCGAGAAGG^^ 

cccACTTccAQcraxmicrccxcAcraTTOVGca^ 

CnSGTXXCJQQClCaXTaTCATXnACCAGr^^ 

TGCCaZCAGCnGCCAGCXSCCCCbKjl ICI IGCCTaXX:CATTTG\TCCrcrecrGATTGA 

GATQQGACCCQCACTXXIQKCCCCrraGCAGCTQCrcraS^^ 

[C,T] 

GrGTaX3QGAGCCraQGACaOTQCTCCr<^axnT^^ 

AAGTCCATCCreAQGMGCXsl I LCI I IGQGAACCGGTTTCAQCXXCAAQCACIQGAQQGAA 

GAGCGATCCATCTCreCrCCAQGAAACCrACTXjGTirrAAOT^ 

COiQQOVCTCCCTXSGAGnTQQGTXjQQGAQCTaXSAQGCCCATCCTCC^ 

24350 AGQGAGAGCOX^TAATGAQGTOQQQQGCCrmSAGA^ 
TTTTCrcrCATCTCrrCAQQGAQGTXKSnTXKiGCTTOm^ 

A GGAATC rAGGCCCCreATGCTGGAAl HjKjCI IAGTOTAGGGOGAGAGGGGGCATATAT 

AATTTGACGTCCAMTtmSACATTTTTCAGAGTGAMGQQGAAGCCA^^ 

CCAGC^C3QQCiQQQGreGQGTXK5CTO\C3GCXnTnAATGCiC^^ 

[T,G] 

GGTCGATCACAGGCnXZAGGAGATGGAGAO^lC 
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TACTAAAMTACAAAAMTCAGCrXXmnXiGreGamiACC^^ 
AQGAQCKnxy^QGOVQGAGMTQGCGTCMCCCGQGAQGCAGAGCrrcC^ 
TCACGCCAaiQCACrCCAGCawmy^CAGAGlXykGACrCOT 
TATTATCCCAQCATQGraQCTC^TmTATAATCXXyVQCA^ 

24558 GAGAGTX3AAAQQQGAAGCCATTAATAATTATQCCAGCACQGCC3Qa 
CaxnVV\TCCO^GCACTIT«5GAQQC0G^ 

ACC^TCCTCGCTAACAajGreAAACCCaJTCTCTACrAAAAATACAA^ 
GCTGG'IGGGGGGCACCllGGAGTOXAGCrAaXIAGGAGGCrcAQS 

[T.C.A] 

OVGAGlXyVGACTCOn'CTOWVAAATAATAATTATTATGCCAGOXT^^ 
ATAATCCCAQGVCrTTQGGAGGCCAAGGCAQGAI ICjLI KaAQQCCAGGAGTTCAAGACCA 
GCaYiGGCAA£ATAGCAAGACm:AlXn<TAAAAAAAAAAA^M^ 
GTGG(niGCXnxnVVGTXDCDVXAACIXy\GGAG^ 
GAGCTGGAGGTRiCAGrcAGCrcAGATTCCA^ 

24872 CAAGACCCCATCTCrAAAAAAAAAAAAAATTAGCCQQQCGTOGlXX^^ 

CCOVGCAt^CrCAGGAGGCTXiAGGrGGGAGGAI IGAGrcrGGGAGGTXSGAGCTTGCA 
GTCAGCTGAGATTGCAtXACTtnVVCrmVGCCTT^^ I C I C 

[-.A] 

AAAAAAAAAAAGAAAAAAAAGTAATAATAATTATXXIG^GGAOVGCAQGTC^ 
TCCTCCI ILICjACrCAGAGCCrcreGrCCAGCACCCCCrAGTCGTXXiAAON^ 
CAQGATAAGGATACATTTAGrerCTAbl I ICilACCTXiGCAAACAGAGreAO^AGATnQQG 
CT TAATAC TITCCAGCTATAAAATTCTAGAATTa^^ 
AGCrriTTAAAAAAAAAATAGAGATCGAGTCnQGCATCT^^ 

25756 AGCTCQGATTAOVQGraiATCCCACO^COM^W^^ 

AGQQGATTTCACCATTTTOK:<^QGCran"CTO\AACTCa^ 
ACCTCA GCCTCCO^AAGrocroGGATTAQVtmnX^ 
TTTTTATACrrTOSGATAATTCTAGAAACTOVt^^ 
TTTAAAGATATCAAAACCXATmCraCJrrATT^^ 

[c.r] 

TAGATATAGMATCTXjrreAATATXnGACAGAATCCO\AGAC^ 
CTCTCAACAAQGOWVGAAAAATQCyWVATXyWXGCCrCTC^^ 
ATCAOXSAGCTCAACAOVGAAQGCCCrCAQGCCGGGCACGGRSG^ 
OWZACmAGGAGGCreAG CnOSATC GATOGCmSAGCCG^^ 
GGGCAACATOGrcAAAGCLIbi 1 1 1 lATAGAGATAAAAAAATACAAAAATTAGCFGGGGG 

25968 GrAGTAGAGrrAA GrGGAG rrGGTC L I I I I l A AAGATATCAAAACCGMTTACI G b l I A T 
TTTAAAAAGAQ\CATTTTCQGAGGAAAACTAGATATAGAAATCTCn^^ 
AATCGCAAGACTX;ATAGATCGACrCrcCCaxnX^(:A(\^^ 

CCQQGCAOQGnRQGaOVGQCCnXTVnm^N^^ 

[G.A] 

CnGAGCCCAGGAGTITGAGACCAGCCTGGGO^ACATCGTiGAAACC C I Ci l 1 1 I lA TAGAG 

ATAAAAAAATACAAAAATTAQCTXaOGGCnTQGreQG^TXJTX^ 

QGAQQaX3AQCTQQGAQGATC3QCrreAC3CCT«W^ 

TTXACCAaa:ACre(:AGCCKKAGGA(:AGAQa5AGACGCrGrCT 

ACCACACACACAGAGAGAAQGCCaTWTAQGaWAGTTXjGAGGAT^^ 

26537 TTAQQaWAS^ra;AGGA^X^7^QQGAAG^O^^ 
AGGCCXnX5Cra3GAQGmAGACTTCATOOr5GT^ 
QQGCTGAGAOmiQGCTCAGAGACCiSXAAQGAGGV^CT^ 
GAQQGAQGCCAAGCn-GQQGO«JreQGTCAAQQQQGCrnGAGAGATCT^ 
TCCTUTGIGIGIGAGQGAGAGTOTaXXCTCAGTCCATAT^^ 
[G.C] 

TCCCniGIGICXXrrcAACAGGAAAGAAGGbl I IGGIGAAQGGGGCAAGAGCCX>VGAGCr 
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QQ0\GGVO\CTO\TC03QGQCCra^VGAQQCCiCACCCCCT^^ 

IXXZAGXAClGCAGCTlGCyVGGGACAGACrcGMTC^^^ 

28204 CCroGAAACATCGATGGACAAGGG L I 1 1 lfa G0CAO\GGreroGCjlfal ( jLlbl IfaGAGGAG 
QQLI Ibl I IGGAGAAGQGAQGCTtSGCTtmSGAGAMCCCGGATCCOQCTO^^ 

GcaxnxaQGreovrcraaanxKTOvrciGi itaC^c^cAGcrcACTCGrATCTCCTGCAC 

TCGTACATOZATOTnMTAOVGTITCTAanXTA^ 

ccArnn-cftGreoGTCOvasTTrcrccoo^^ 

[C,T,A,G] 

GCTTAAAAAGrrACTOQCnALIGI I I I lAAGMCACTTTTATAGAGrrAlGTCGA/VQGCM 
GrrAAGAGCG\ATO\CreATCCCCAAL,IGI I ILI IGAGCATCreGTCTQQGQQGACCACT 
TTGATCG GACCCA CCCTTGGAAAQnCAGGGGTACXjCCC^GGTGGGATGCrCAC^ 
ACreAfiGG nrrgg rreGCATObl IGI 1 1 1 IGAATXnAGCAQVAGOGAT^yNGCfV^ACrcr 
ATAAGAGTCTITrAAAAATTAACTTtXCAQGAAG^^ 
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